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10 



15 



20 



25 



30 



Technical field, 

The present invention relates to new isolates 
of the viral class Hepatitis C, polypeptides, 
polynucleotides and antibodies derived therefrom, as well 
as the use of such polypeptides, polynucleotides and 
antibodies in assays (e.g., immunoassays, nucleic acid 
hybridization assays, etc.) and in the production of 
viral polypeptides. 

Background 

Non-A, Non-B hepatitis (NANBH) is a 
transmissible disease or family of diseases that are 
believed to be viral -induced, and that are 
distinguishable from other forms of viral-associated 
liver diseases, including that caused by the known 
hepatitis viruses, i.e., hepatitis A virus (HAV) , 
hepatitis B virus (HBV) , and delta hepatitis virus (HDV) , 
as well as the hepatitis induced by cytomegalovirus (CMV) 
or Epstein-Barr virus (EBV) . NANBH was first identified 
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in transfused individuals. Transmission from man to 
chimpanzee and serial passage in chimpanzees provided 
evidence that NANBH is due to a transmissible infectious 
agent or agents. Epidemiologic evidence is suggestive 
5 that there may be three types of NANBH: the water-borne 
epidemic type; the blood or needle associated type; and 
the sporadically occurring (community acquired) type. 
However, until recently, no transmissible agent 
responsible for NANBH had not been identified. 

10 Clinical diagnosis and identification of NANBH 

has been accomplished primarily by exclusion of other 
viral markers. Among the methods used to detect putative 
NANBH antigens and antibodies are agar-gel diffusion, 
counter immunoelectrophores is , immunofluorescence 

15 microscopy, immune electron microscopy, radioimmunoassay, 
and enzyme-linked immunosorbent assay. However, none of 
these assays has proved to be sufficiently sensitive, 
specific, and reproducible to be used as a diagnostic 
test for NANBH. 

20 Until recently there has been neither clarity 

nor agreement as to the identity or specificity of the 
antigen antibody systems associated with agents of NANBH. 
It is possible that NANBH is caused by more than one 
infectious agent and unclear what the serological assays 

25 detect in the serum of patients with NANBH. 

In the past, a number of candidate NANBH agents 
were postulated. See, e.g., Prince (1983) Ann. Rev. 
Microbiol. 32:217; Peinstone & Hoofnagle (1984) New Eng. 
J. Med. Hl:185; Overby (1985) Curr. Heptol. 5:49; Overby 

30 (1986) Curr. Heptol. 6:65; Overby (1987) Curr. Heptol. 
2:35; and Iwarson (1987) British Med. J. 295:946- 
However, there is no proof that any of these candidates 
represent the etiological agent of NANBH. 

In 1987, Houghton et al. cloned the first virus 

35 definitively linked to NANBH. See, e.g., EPO Pub. No. 
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318,216; Houghton et al., Science 2£4:359 (1989). 
Houghton et al. described therein the cloning of an 
isolate from a new viral class, hepatitis C virus (HCV) , 
the prototype isolate described therein being named 
5 "HCV1" . HCV is a Flavi-like virus, with an RNA genome. 
Houghton et al. described the production of recombinant 
proteins from HCV sequences that are useful as diagnostic 
reagents, as well as polynucleotides useful in diagnostic 
hybridization assays and in the cloning of additional HCV 
10 isolates. 

The demand for sensitive, specific methods for 
screening and identifying carriers of NANBH and NANBH 
contaminated blood or blood products is significant. 
Post-transfusion hepatitis (PTH) occurs in approximately 

15 10% of transfused patients, and NANBH accounts for up to 
90% of these cases. There is a frequent progression to 
chronic liver damage (25-55%) . 

Patient care as well as the prevention of 
transmission of NANBH by blood and blood products or by 

20 close personal contact require reliable diagnostic and 
prognostic tools to detect nucleic acids, antigens and 
antibodies related to NANBH. In addition, there is also 
a need for effective vaccines and immunotherapeutic 
therapeutic agents for the prevention and/or treatment of 

25 the disease. 

While at least one HCV isolate has been 
identified which is useful in meeting the above needs, 
additional isolates, particularly those with divergent a 
genome, may prove to have unique applications. 

30 

Summary of the Invention 

New isolates of HCV has been characterized from 
Japanese blood donors who have been implicated as NANBH 
carriers. These isolates exhibit nucleotide and amino 
35 acid sequence heterogeneity with respect to the prototype 
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isolate, HCVl, in several viral domains. It is believed 
that these distinct sequences are of in importance, 
particularly in diagnostic assays and in vaccine 
development, 

5 In one embodiment, the present invention 

provides a DNA molecule comprising a nucleotide sequence 
of at least 15 bp from an HCV isolate substantially 
homologous to an isolate selected from the group Jl or 
J7, wherein said nucleotide sequence is distinct from the 

10 nucleotide sequence of HCV isolate HCVl. 

In another embodiment, the present invention 
provides a DNA molecule comprising a nucleotide sequence 
of at least 15 bp encoding an amino acid sequence from a 
HCV isolate Jl or J7 wherein the Jl or J7 amino acid 

15 sequence is distinct from the amino acid sequence of HCV 
isolate HCVl. 

Yet another embodiment of the present invention 
provides a purified polypeptide comprising an amino acid 
sequence from an HCV isolate substantially homologous to 

20 an isolate selected from the group Jl and J7, wherein 

said amino acid sequence is distinct from the sequence of 
the polypeptides encoded by the HCV isolate HCVl. 

Still another embodiment of the present 
invention provides a polypeptide comprising an amino acid 

25 sequence from a HCV isolate Jl or J7 wherein the Jl or J7 
amino acid sequence is distinct from the amino acid 
sequence of HCV isolate HCVl and the polypeptide is 
immobilized on a solid support. 

In a further embodiment of the present 

30 invention, an immunoassay for detecting the presence of 
anti-HCV antibodies in a test sample is provided 
comprising: (a) incubating the test sample under 
conditions that allow the formation of antigen-antibody 
complexes with an immunogenic polypeptide comprising an 

35 amino acid sequence from an HCV isolate substantially 
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homologous to an isolate selected from the group Jl and 
J7, wherein the amino acid sequence is distinct from the 
amino acid sequence of HCV isolate HCV1; and (b) 
detecting an antigen-antibody complex comprising the 
5 immunogenic polypeptide. 

The present invention also provides a 
composition comprising anti-HCV antibodies that bind an 
HCV epitope substantially free of antibodies that do not 
bind an HCV epitope, wherein: (a) the HCV epitope 

10 comprises an amino acid sequence from an HCV isolate 

substantially homologous to an isolate selected from the 
group Jl and J7, wherein the amino acid sequence is 
distinct from the amino acid sequence of HCV isolate 
HCV1; and (b) the Jl or J7 amino acid sequence is not 

15 immunologically cross-reactive with HCV1. 

A further embodiment of the present invention 
provides an immunoassay for detecting the presence of an 
HCV polypeptide in a test sample comprising: (a) 
incubating the test sample under conditions that allow 

20 the formation of antigen-antibody complexes with anti- 
HCV antibodies that bind an HCV epitope wherein: (i) the 
HCV epitope comprises an amino acid sequence from a HCV 
isolate Jl or J7; (ii) the Jl or J7 amino acid sequence 
is distinct from the amino acid sequence of HCV isolate 

25 HCV1; and (iii) the Jl or J7 amino acid sequence is not 
immunologically cross-reactive with HCV1; and (b) 
detecting an antigen-antibody complex comprising the 
anti-HCV antibodies. 

Also provided by the present invention is a 

30 method of producing anti-HCV antibodies comprising 

administering to a mammal a polypeptide comprising an 
amino acid sequence from a HCV isolate Jl or J7 wherein 
the Jl or J7 amino acid sequence is distinct from the 
amino acid sequence of HCV isolate HCV1 whereby the 

35 mammal produces anti-HCV antibodies. 
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Yet another embodiment of the present invention 
provides a method of detecting HCV polynucleotides in a 
test sample comprising: (a) providing a probe comprising 
the DNA molecule of claim l; (b) contacting the test 
5 sample and the probe under conditions that allow for the 
formation of a polynucleotide duplex between the probe 
and its complement in the absence of substantial 
polynucleotide duplex formation between the probe and 
non-HCV polynucleotide sequences present in the test 

10 sample; and (c) detecting any polynucleotide duplexes 
comprising the probe. 

A still further embodiment of the present 
invention provides a method of producing a recombinant 
polypeptide comprising an HCV amino acid sequence, the 

15 method comprising: (a) providing host cells transformed 

by a DNA construct comprising a control sequences for the 
host cell operably linked to a coding sequence encoding 
an amino acid sequence from a HCV isolate Jl or J7 
wherein the Jl or J7 amino acid sequence is distinct from 

20 the amino acid sequence of HCV isolate HCVl; (b) growing 
the host cells under conditions whereby the coding 
sequence transcribed and translated into the recombinant 
polypeptide; and (c) recovering the recombinant 
polypeptide. 

25 These and other embodiments of the present 

invention will be readily apparent to those of ordinary 
skill in the art in view of the following description. 



grief Description of the Figures 
30 Figure 1 shows the consensus sequence of the 

coding strand of a fragment from the J7 C/E domain with 

the heterogeneities. 

Figure 2 shows the consensus sequence of the 

coding strand of a fragment from the Jl E domain with the 
35 heterogeneities. 
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Figure 3 shows the consensus sequence of the 
coding strand of a fragment of the Jl E/NSl domain with 
the heterogeneities. 

Figure 4 shows the consensus sequence of the 
5 coding strand of a fragment from the Jl NS3 domain with 
the heterogeneities. 

Figure 5 shows the consensus sequence of the 
coding strand of a fragment from the Jl NS5 domain with 
the heterogeneities. 
10 Figure 6 shows the homology of the J7 C/E 

consensus sequence with the nucleotide sequence of the 
same domain from HCV1. 

Figure 7 shows the homology of the Jl E 
consensus sequence with the nucleotide sequence of the 
15 same domain from HCV1. 

Figure 8 shows the homology of the Jl E/NSl 
consensus sequence with the nucleotide sequence of the 
same domain from HCV1. 

Figure 9 shows the homology of the Jl NS3 
20 consensus sequence with the nucleotide sequence of the 
same domain from HCV1. 

Figure 10 shows the homology of the Jl NS5 
consensus sequence with the nucleotide sequence of the 
same domain from HCV1. 
25 Figure 11 shows the putative genomic 

organization of the HCV1 genome. 

Figure 12 shows the nucleotide sequence of the 
ORF of HCV1. In the figure nucleotide number 1 is the 
first A of the putative initiating methionine of the 
30 large ORF; nucleotides upstream of this nucleotide are 
numbered with negative numbers. 

Figure 13 shows the consensus sequence of the 
coding strand of a fragment from the Jl NSl domain (Jl 
1519) with the nucleotide sequence of the same domain 



WO 91/04262 PCT/US90/05242 

-8- 

from HCVl. Also shown are the amino acids encoded 
therein. 

Figure 14 shows a composite of the consensus 
sequence from the core to the NS1 domain of Ji with the 
5 nucleotide sequence of the same domain from HCVl. Also 
shown are the amino acids encoded therein. 

Figure 15 shows a consensus sequence of the 
coding strand of the NS1 domain of Jl, as determined in 
Example IV. Also shown are the nucleotide sequence of 
10 the same domain from HCV1, and the amino acids encoded in 
the HCV1 and Jl sequences. 

Figure 16 shows a consensus sequence of a 
coding strand of the C200 region of the NS3-NS4 domain of 
Jl. Also shown are the nucleotide sequence of the same 
15 domain from HCVl. Also shown are the amino acids encoded 
in the sequences. 

Figure 17 shows a consensus sequence of the 
coding strand of the NS1 domain of Jl, as determined in 
Example V. Also shown are the nucleotide sequence of the 
20 same domain from HCVl, and the amino acids encoded in the 
sequences. 

Figure 18 shows a consensus sequence of the 
coding strand of the untranslated and core domains of Jl. 
Also shown are the nucleotide sequence of the same domain 
25 from HCVl, and the amino acids encoded in the sequences. 



Periled Description of the i nvention 

The practice of the present invention will 

30 employ, unless otherwise indicated, conventional 
techniques of molecular biology, microbiology, 
recombinant DNA techniques, and immunology, which are 
within the skill of the art. Such techniques are 
explained fully in the literature. §g£ e.g. , Maniatis, 

35 Fitsch & Sambrook, MOLECULAR CLONING; A LABORATORY MANUAL 
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(1982); DNA CLONING, VOLUMES I AND XT m » ri 
NUCLEIC ACID HYBRIDIZATION (B.D. „ Mes , s"^^' 
^ edS - 198 «" *™U. CELL CULTURE (R.I 

Preshney ed. 1986, . MOBILIZED cells and enzymes (IRL 

SS.^.f-tT" 1 : * PRACTICAL GUIDE TO MOLECULAR 
LONING (1984); th. series, METHODS in ENZYMOLOGY 
(Academic Pres., Inc.); GENE TRAMSPER VECTORS POR 
» MAMMALIAN CELLS (J .H. Miller end M.P. 

™ ST"" ^""W . Methods in Enzymoiogy 

Vol. 154 ana vol. iss m , and Grossman, and Wu .ds 
respectively,, „ ayer lnd „ alk . r , „ ds . *' 

15 '™™ 0C f H1CAl ME ™ 0DS IN CELL AND MOLECULAR BIOLOGY 

hereby incorporated herein by reference, 
by worfcerstn 1"™ "T^"* ° VW 

a. "heP^tiVcT V"™- ACCO " ln »*' " »~ herein, 

NANBH which f <HCV) rSferS t0 a » ent causative of 
H«BH^which was formerly referred to as nanbv and/or 

dL^I T" ° lMS °' 0,6 "»«W *—*•> hcvi, 

3i""iT.„dVr hton et ai - see ' «> *» »• 

orLwf ' aV " llabU « applications claiming 

Priority therefrom, , the disclosure, of which are 
incorporated herein by reference. The nucleotide 
sequence and putative amino acid serene, of HCVI is 

» ZZZ LrV' ^ ta ™ »«»v, and bb-nanbv 

are used interchangeably herein. As an extension o, this 
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terminology, the disease caused by HCV, formerly called 
NANB hepatitis (NANBH) , is called hepatitis C. The terms 
NANBH and hepatitis C may be used interchangeably 
herein. The term "HCV", as used herein, denotes a viral 
5 species of which pathogenic strains cause NANBH, as well 
as attenuated strains or defective interfering particles 
derived therefrom. 

HCV is a Flavi-like virus. The morphology and 
composition of Flavivirus particles sure known, and are 

10 discussed by Brinton (1986) THE VIRUSES: THE TOGAVIRIDAE 
AND FLAVXVIRI DAE (Series eds. Fraenkel-Conrat and Wagner, 
vol eds. Schlesinger and Schlesinger, Plenum Press), 
p. 327-374. Generally, with respect to morphology, 
Flaviviruses contain a central nucleocapsid surrounded by 

15 a lipid bi layer. Virions are spherical and have a 

diameter of about 40-50 nm. Their cores are about 25-30 
ran in diameter. Along the outer surface of the virion 
envelope are projections that are about 5-10 nm long with 
terminal knobs about 2 nm in diameter. 

20 The HCV genome is comprised of RNA. It is 

known that RNA containing viruses have relatively high 
rates of spontaneous mutation, i.e., reportedly on the 

—3 —4 

order of 10 to 10 per incorporated nucleotide. 
Therefore, there are multiple strains, 
25 which may be virulent or avirulent, within the HCV class 
or species. 

It is believed that the genome of HCV isolates 
is comprised of a single ORF of approximately 9,000 
nucleotides to approximately 12,000 nucleotides, encoding 

30 a polyprotein similar in size to that of HCV1, an encoded 
polyprotein of similar hydrophobic and antigenic 
character to that of HCV1, and the presence of co-linear 
peptide sequences that are conserved with HCV1. In 
addition, the genome is believed to be a 

35 positive-stranded RNA. 
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Isolates of HCV comprise epitopes that are 
immunologically cross-reactive with epitopes in the HCVl 
genome. At least some of these are epitopes unique to 
HCV when compared to other known Flaviviruses . The 
5 uniqueness of the epitope may be determined by its 
immunological reactivity with anti-HCV antibodies and 
lack of immunological reactivity with antibodies to other 
Flavivirus species. Methods for determining 
immunological reactivity are known in the art, for 

10 example, by radioimmunoassay, by ELISA assay, by 

hemagglutination, and several examples of suitable 
techniques for assays are provided herein. 

It is also expected that the overall homology 
of HCV isolates and HCVl genomes at the nucleotide level 

15 probably will be about 40% or greater, probably about 60% 
or greater, and even more probably about 80% to about 90% 
or greater. In addition that there are many corresponding 
contiguous sequences of at least about 13 nucleotides 
that are fully homologous. The correspondence between 

20 the sequence from a new isolate and the HCVl sequence can 
be determined by techniques known in the art. For 
example, they can be determined by a direct comparison of 
the sequence information of the polynucleotide from the 
new isolate and HCVl sequences. Alternatively, homology 

25 can be determined by hybridization of the polynucleotides 
under conditions which form stable duplexes between 
homologous regions (for example, those which would be 
used prior to S 1 digestion) , followed by digestion with 
single-stranded specific nuclease (s), followed by size 

30 determination of the digested fragments. 

Because of the evolutionary relationship of the 
strains or isolates of HCV, putative HCV strains or 
isolates are identifiable by their homology at the 
polypeptide level. Thus, new HCV isolates are expected 

35 to be more than about 40% homologous, probably more than 
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»o% homologous at tte polypepti J ~ ~ 
for determining amino a cid s ^ enoe homology are^no^n 
5 the art. For example, the ^ acid ^J^TLfT 
datermined directly „ a spared ^ ^ 
provided herein. aiternatively the nucleoid, seguence 
of the genome material of the putative HCV may bT 

be determined and the corresponding region, compared. 

The off of hcvi is shown in Figure 12 «,! 
non-structural, core, and envelop. domainTof L 
po yprot™ hav e been predicted f or HCVI (Figur.^, . ^ 

» the^te^LsTT f 18 b,Ue,M t0 * — - 
™e 5 teniinus to about nucleotide 345 of hcvi ™. 

putative "E", or envelop., dom , ln ot acv! i^ u .^! t 
mTTJZ ~ - - "or id" 

20 nucleotide 1953 P „ „,,. "«= nucleotide 1051 to about 
i. «, refining domains, putative NS3 

is thought to be encoded fro, about nucleoid. l954to 
•bout nucleotide 3018, putative NS3 tv™, .„ ! 
3019 to about nucleotide' «S pu^^rT??'' 

25 KS5 fro. about nucleotide 629, to the 3- terMnus 

ZZ oTT. r ~ 'Prions . 

c^ L^r ? m, ° fthe0IW - "» ««* boundaries * 
=» he determined by those skilled in the art in vie! of 
the disclosure herein. 

new HCV isolates characterized by the nucl Jhh 
disclosed herein, as well as rJt^iS^rT" 
substantially homologous thereto- i e ! ? " 
■0% or about 95% at L nuclei ^i" ^ 
3S that the seances disclosed herein ^acter^ ^ST 
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subclass that is predominant in Japan and other Asian 
and/or Pacific rim countries. Additional Jl and J7 
isolates can be obtained in view of the disclosure herein 
and EPO Pub. No. 318,216. In particular, the Jl and J7 
5 nucleotide sequences disclosed herein, as well as the 
HCV1 sequences in Figure 12, can be used as primers or 
probes to clone additional domains of Jl, J7 , or 
additional isolates. 

As used herein, a nucleotide sequence "from" a 

10 designated sequence or source refers to a nucleotide 
sequence that is homologous (i.e., identical) to or 
complementary to the designated sequence or source, or a 
portion thereof . The Jl sequences provided herein are a 
minimum of about 6 nucleotides, preferably about 8 

15 nucleotides, more preferably about 15 nucleotides, and 
most preferably 20 nucleotides or longer. The maximum 
length is the complete viral genome. 

In some aspects of the invention, the sequence 
of the region from which the polynucleotide is derived is 

20 preferably homologous to or complementary to a sequence 
which is unique to an HCV genome or the Jl and J7 genome. 
Whether or not a sequence is unique to a genome can be 
determined by techniques known to those of skill in the 
art. For example, the sequence can be compeared to 

25 sequences in databanks, e.g., Genebank, to determine 
whether it is present in the uninfected host or other 
organisms. The sequence can also be compared to the 
known sequences of other viral agents, including those 
which are known to induce hepatitis, e.g., HAV, HBV, and 

30 HDV, and to other members of the Flaviviridae. The 
correspondence or non-correspondence of the derived 
sequence to other sequences can also be determined by 
hybridization under the appropriate stringency condi- 
tions. Hybridization techniques for determining the 

35 complementarity of nucleic acid sequences are known in 
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Harbor Press, cold spring Harbor , (C °" 

including for example, digestion with a nuoi., 

SI that specifically digests singL^tran^ ** 

» rr ^in, - 

non-transcribed and/or non-translated regions 

but may be aenera^H < nucleotide sequence shown, 

- ~»PU. chLtaT.^ orZ'ra^r ~ 

transoription or transcription ^nX" " """" 

An amino acid seouence »*y~^n ^ , 
Polypeptide or source of r^Z \ * desi * nated 

amino acid seauencTT I ^Wl**— means that the 
awxu sequence is homolooous n 
" the sequence of «,« ... • ^ ous (1 - e -' identical) to 

th.reorT Z JL2 aciT. - • Portion 

nucleic acid iZlTlZZT' ' 
-ino acid sequence IZT^Z^TIT^ " 
encoded in the sequence or . T !r P°lyP»Ptide 

" J'a^oacidse^s r^e ^ tl, * re0 '- ^ J1 « 
Present invention™^ SST^I iT • 

^ P«ferabl y at least 2Z 11 ZZ ^ ~ 
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The polypeptides of the present invention are 
not necessarily translated from a designated nucleic acid 
sequence; the polypeptides may be generated in any 
manner, including for example, chemical synthesis, or 
5 expression of a recombinant expression system, or 

isolation from virus. The polypeptides may include one 
or more analogs of amino acids or unnatural amino acids. 
Methods of inserting analogs of amino acids into a 
sequence are known in the art. The polypeptides may also 

10 include one or more labels, which are known to those of 
skill in the art. 

The term "recombinant polynucleotide" as used 
herein intends a polynucleotide of genomic, cDNA, 
semisynthetic, or synthetic origin which, by virtue of 

15 its origin or manipulation: (1) is linked to a 

polynucleotide other than that to which it is linked in 
nature, or (2) does not occur in nature. 

The term "polynucleotide* 9 as used herein refers 
to a polymeric form of nucleotides of any length, either 

20 ribonucleotides or deoxyribonucleotides. This term 
refers only to the primary structure of the molecule. 
Thus, this term includes double- and single-stranded DNA, 
and RNA. It also includes known types of modifications, 
for example, labels which are known in the art, 

25 methylation, "caps", substitution of one or more of the 
naturally occurring nucleotides with an analog, 
internucleotide modifications such as, for example, those 
with uncharged linkages (e.g., methyl phosphonates, 
phosphotriesters, phosphoamidates, carbamates, etc.) and 

30 with charged linkages (e.g., phosphor othioates, 

phosphor odithioates, etc.), those containing pendant 
moieties, such as, for example proteins (including for 
e.g., nucleases, toxins, antibodies, signal peptides, 
poly-L- lysine, etc.), those with intercalators (e.g., 

35 acridine, psoralen, etc.), those containing chelators 
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(e.g., metals, radioactive metals, boron oxid**- 

metals e-tn \ *u jron ' oxidative 

etc.,, as well as ^Ef^T^." 01 ' 1 " a ° idS ' 

tree of other exponents ™'h . ^ "-tatantially 

to 99* of the specified polypeptide. 

"=.11.. CeUS "' " hMt «"•■. 

terL denote Bi cro ' ; Ce " CTltUrM "> « «^ ~* 

have been, us6d „ rec '^ «" « 

ether transfer DNA JTi^ "«»»binant vector or 

-u .h^ r sr^srr^ 

in genomic or total dha ™ , ldent ^al xn morphology or 

" ^.rrr - ---- • 

.*« U. ™ «. J, '' '-■ "»■»• « ~>U«U«, 

A "cloning vector" is a >-«~n - 

35 transfer- a selected host ceU and T "l^" ~ 

»st ceil and m which another 
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polynucleotide segment is attached, so as to bring about 
the replication and/or expression of the attached 
segment. Typically, cloning vectors include plasmids, 
virus (e.g., bacteriophage vector) and cosmids. 
5 An "integrating vector" is a vector that does 

not behave as a replicon in a selected host cell, but has 
the ability to integrate into a replicon (typically a 
chromosome) resident in the selected host to stably 
transform the host. 

An "expression vector" is a construct that can 
transform a selected host cell and provides for 
expression of a heterologous coding sequence in the 
selected host. Expression vectors can be either a 
cloning vector or an integrating vector. 

A "coding sequence" is a polynucleotide 
sequence which is transcribed into mRNA and/or translated 
into a polypeptide when placed under the control of 
appropriate regulatory sequences. The boundaries of the 
coding sequence are determined by a translation start 
codon at the 5' -terminus and a translation stop codon at 
the 3 '-terminus. A coding sequence can include, but is 
not limited to mRNA, cDNA, and recombinant polynucleotide 
sequences. 

"Control sequence" refers to polynucleotide 
regulatory sequences which are necessary to effect the 
expression of coding sequences to which they are ligated. 
The nature of such control sequences differs depending 
upon the host organism, in prokaryotes, control 
sequences generally include promoter, ribosomal binding 
30 site, and terminators, m eukaryotes generally control 
sequences include promoters, terminators and, in some 
instances, enhancers. The term "control sequences" is 
intended to include, at a minimum, all components the 
presence of which are necessary for expression, and may 
also include additional advantageous components. 
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5 ""f 01 /: 9 " 6 "" to a cooina seance 

5 " U ' atea ln » »y that expression of tL co^IT 

sequence is achieve,, under conditions compattote^'the 
control sequences. w 

An "open reading frame- or osp is a r „ lo „ of . 
poiynucleotide sequence which encodes a polypi? £ is 

rotal coding sequence. 

or M ~ "^^^ically cross-reactive" refers to two 
or .ore epitopes or polypeptides that are bound by the 
sa ffl e antibody. Cross-reactivity can be deterge! bTany 
15 of a number of immunoassay techniques, such as a 
competition assay. 

As used herein, the term "antibody" refers to a 
polypeptide or .roup of polypeptides which comprise at 

20 from J n V^?° Pe ' ^ Manti ^ n bindi »9 site" is formed 
from the folding of the variable domains of an antibody 

z iz^'i to f «~»«^ »^ S ite S :i 

an internal surface shape and charge distribution 
complementary to the features of an epitope of an 
antigen, which allows specific binding to form an 

fo»efr nti9en COBPleX * ^ anti ^ ^ding site may be 
formed from a heavy- and/or light-chain domain (VH anl 
VL respectively,, which form hypervariable loops which 
contribute to antigen binding. The term "antibody" 

30 alt T' With ° Ut limitation < «*i-«lo antibodies, 
30 altered antxbodies, univalent antibodies, Pab proteins 

and single-domain antibodies. Xn many cases, 'Le ^ng 
Phenomena of antibodies to antigens is equivalent to 
other lxgand/anti-iigand binding. 

35 (t3 in • ^ US6d hereln ' 3 " Single dooain antibody" 

(dAb) _„ an antibody which is comprised of an HL dLain, 
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which binds specifically with a designated antigen. A 
dAb does not contain a VL domain, but may contain other 
antigen binding domains known to exist to antibodies for 
example, the kappa and lambda domains. Methods for ' 
5 preparing dAbs are known in the art. See , for exafflple 
Ward et al, Nature Hi: 544 (1989) . 

Antibodies may also be comprised of VH and VL 

Exlie 8 : II T 1 as other taown antigen bindin * do » ai -' 

Examples of these types of antibodies and methods for 
their preparation and known in the art (see, e.g. u s 
Patent No. 4,816,467, which is incorporated herein by * 
reference) , and include the following. For example 
vertebrate antibodies" refers to antibodies which are 
tetramers or aggregates thereof, comprising light and 
heavy chains which are usually aggregated in a "Y" 
configuration and which may or may not have covalent 
linkages between the chains, m vertebrate antibodies, 
the amano acxd sequences of the chains are homologous 
with those sequences found in antibodies produced in 

hvtL^ 5 ' Wh6ther " SitU ° r in Vitr ° f « —Pie, * 
hybrxdomas, Vertebrate antibodies include, for example, 

ml d / yCl ° nal antib ° dlea — -noclonal antibodies 
methods for the preparation of which are described infra 
"Hybrid antibodies" are antibodies where chains 

a n^T rately hon ° logous with reference to mammalian 
antibody chains and represent novel assemblies of them 
so that two different antigens are precipitable by the' 
tetramer or aggregate, m hybrid antibodies, one pair of 

an ea aTtiT d ll9ht ***** *° *~ *™* * 

an antibody raised against a first antigen, while a 

!ntr/ air . 0f ChainS honol ^ ous to those found in an 
antibody raised against a second antibody, This results 
in the property of "divalence", i.e., the ability to bind 
two antigens simultaneously, such hybrids may also be 
formed using chimeric chains, as set forth below 
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"Chimeric antibodies" refers to antibodies in 
which the heavy and/or light chains are fusion proteins. 
Typically, one portion of the amino acid sequences of the 
chain is homologous to corresponding sequences in an 
5 antibody derived from a particular species or a 

particular class, while the remaining segment of the 
chain is homologous to the sequences derived from another 
species and/ or class. Usually, the variable region of 
both light and heavy chains mimics the variable regions 

10 or antibodies derived from one species of vertebrates, 
while the constant portions are homologous to the 
sequences in the antibodies derived from another species 
of vertebrates. However, the definition is not limited 
to this particular example. Also included is any 

15 antibody in which either or both of the heavy or light 
chains are composed of combinations of sequences 
mimicking the sequences in antibodies of different 
sources, whether these sources be from differing classes 
or different species of origin, and whether or not the 

20 fusion point is at the variable/ constant boundary. Thus, 
it is possible to produce antibodies in which neither the 
constant nor the variable region mimic know antibody 
sequences. It then becomes possible, for example, to 
construct antibodies whose variable region has a higher 

25 specific affinity for a particular antigen, or whose 

constant region can elicit enhanced complement fixation, 
or to make other improvements in properties possessed by 
a particular constant region. 

Another example is "altered antibodies", which 

30 refers to antibodies in which the naturally occurring 
amino acid sequence in a vertebrate antibody has been 
varies. Utilizing recombinant DNA techniques, antibodies 
can be redesigned to obtain desired characteristics. The 
possible variations are many, and range from the changing 

35 of one or more amino acids to the complete redesign of a 
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region, for example, the constant region. Changes in the 
constant region, in general, to attain desired cellular 
process characteristics, e.g., changes in complement 
fixation, interaction with membranes, and other effector 
5 functions. Changes in the variable region may be made to 
alter antigen binding characteristics. The antibody may 
also be engineered to aid the specific delivery of a 
molecule or substance to a specific cell or tissue site. 
The desired alterations may be made by known techniques 
10 in molecular biology, e.g., recombinant techniques, site- 
directed mutagenesis, etc. 

Yet another example are "univalent antibodies" 
which are aggregates comprised of a heavy-chain/ light- 
chain dimer bound to the Pc (i.e., stem) region of a 
15 second heavy chain. This type of antibody escapes 

antigenic modulation. See, e.g., Glennie et al. Nature 
225: 712 (1982) . Included also within the definition of 
antibodies are "Fab" fragments of antibodies. The "Fab" 
region refers to those portions of the heavy and light 
chains which are roughly equivalent, or analogous, to the 
sequences which comprise the branch portion of the heavy 
and light chains, and which have been shown to exhibit 
immunological binding to a specified antigen, but which 
lack the effector Fc portion. "Fab" includes aggregates 
of one heavy and one light chain (commonly known as 
Fab'), as well as tetramers containing the 2H and 2L 
chains (referred to as F(ab) 2 ), which are capable of 
selectively reacting with a designated antigen or antigen 
family. Fab antibodies may be divided into subsets 
analogous to those described above, i.e., "vertebrate 
Fab", "hybrid Fab", "chimeric Fab", and "altered Fab". 
Methods of producing Fab fragments of antibodies are 
known within the art and include, for example, 
proteolysis, and synthesis by recombinant techniques. 
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"Epitope" refers to an antibody binding site 
usually defined by a polypeptide, but also by non-amino 
acid haptens. An epitope could comprise 3 amino acids 
in a spatial conformation which is unique to the epitope, 
5 generally an epitope consists of at least 5 such amino 
acids, and more usually, consists of at least 8-10 such 
amino acids. 

"Antigen-antibody complex 1 * refers to the 
complex formed by an antibody that is specifically bound 
10 to an epitope on an antigen. 

"Immunogenic polypeptide" refers to a 
polypeptide that elicits a cellular and/or humoral immune 
response in a mammal, whether alone or linked to a 
carrier, in the presence or absence of an adjuvant. 
15 "Polypeptide" refers to a polymer of amino 

acids and does not refer to a specific length of the 
molecule. Thus, peptides, oligopeptides, and proteins 
are included within the definition of polypeptide. This 
term also does not refer to or exclude post-expression 
20 modifications of the polypeptide, for example, 

glycosy lations , acetylations, phosphorylations and the 
like. Included within the definition are, for example, 
polypeptides containing one or more analogs of an amino 
acid (including, for example, unnatural amino acids, 
25 etc.), polypeptides with substituted linkages, as well as 
other modifications known in the art, both naturally 
occurring and non-naturally occurring. 

"Transformation", as used herein, refers to the 
insertion of an exogenous polynucleotide into a host 
30 cell, irrespective of the method used for the insertion, 
for example, direct uptake, transduction, f -mating or 
electroporation . The exogenous polynucleotide may be 
maintained as a non- integrated vector, for example, a 
plasmid, or alternatively, may be integrated into the 
35 host genome. 
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A "transformed" host cell refers to both the 
immediate cell that has undergone transforation and its 
progeny that maintain the originally exogenous 
polynucleotide. 

5 "Treatment" as used herein refers to 

prophylaxis and/or therapy. 

-individual", refers to vertebrates, 
particularly members of the mammalian species, and 
includes but is not limited to domestic animals, sports 
10 animals, and primates, including humans. 

"Sense strand" refers to the strand of a 
double-stranded DNA molecule that is homologous to a mRNA 
transcrxpt thereof. The "anti-sense strand" contains a 
sequence which is complementary to that of the "sense 
15 strand-. 

"Antibody-containing body component" refers to 
a component of an individual's body which is a source of 
the antibodies of interest. Antibody-containing body 
components are known in the art, and include but are not 
limited to, whole blood and components thereof, plasma 
serum, spinal fluid, lymph fluid, the external sections 
of the respiratory, intestinal, and genitourinary tracts 
tears, saliva, milk, white blood cells, and myelomas. ' 

"Purified HCV" isolate refers to a preparation 
of HCV particles which has been isolated from the cel- 
lular constituents with which the virus is normally as- 
sociated, and from other types of viruses which may be 
present in the infected tissue. The techniques for 
isolating viruses are known to those of skill in the art 
and include, for example, centrifugation and affinity ' 
chromatography . 

An HCV "particle" is an entire virion, as well 

Hc/L^T WhlCh ^ interaediates " virion formation. 
HCV particles generally have one or more HCV proteins 
associated with the HCV nucleic acid. 
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a hvbri* Jl^T' r6ferS t0 3 pol ^*<*i*e which forms 
a hybrid structure with a sequence in a target 

polynucleotide, due to complementarity of at least one 
regxon in the probe with a region in the target 

"Biological sample- refers to a sample of 

LTnTiiLT* , isolated fro * an lndividua1 ' 

but not limited to, for example, whole blood and 

T 60 '' PlaS * a ' S6rU *' Spinal fluid ' 
fluid the external sections of the skin, respirator^ 

XTT' md genitOUri ^ tears, Lliva^' 

blood cells, tumors, organs, and also samples of in 
^ cell culture constituents (including but nit 
limited to conditioned medium resulting from the growth 
15 !/ CUltUre BediUn ' P^atively virally 

C T e h 11S : cells, and ceU components, . 

The invention pertains to the isolation and 

« beUavad to characteristic of HCV isolates, fro. Japan 
and other Asian countries. 

25 ana .n™,, 11 " »<"=l«ctide sequences derived fro HCV/Jl 
and HCV/J7 ara useful , s to di e ^ 

of virus in saeples, and to isolate other natu^"! 
currin, variants o, the virus. Th.se nucleotide 
sequences also ^ available polypeptide seguences of 

per»it. the production of pdypeptides which are useful 
as standards or reagents in diagnostic tests and/or as 
components of vaccinas, Antibodies, both polyclonal L 

::ZnT dlreotea aaai,,st hct 

35 ^ Polypeptide sequences are also useful for 

35 diagnostic tests, as therapeutic agents, for screening of 
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antiviral agents, and for the isolation of the NANBH 
virus, in addition, by utilizing probes derived from the 
sequences disclosed herein it is possible to isolate and 
sequence other portions of the Ji and J7 genome, thus 
5 giving rise to additional probes and polypeptides which 
are useful in the diagnosis and/or treatment, both 
prophylactic and therapeutic, of NANBH. 

The availability of the HCV/Ji and HCV/J7 
nucleotide sequences enable the construction of 
10 polynucleotide probes and polypeptides useful in 

diagnosing NANBH due to HCV infection and in screening 
blood donors as well as donated blood and blood products 
for infection. For example, from the sequences it is 
possible to synthesize DNA oligomers of about 8-10 
15 nucleotides, or larger, which are useful as hybridization 
probes to detect the presence of HCV RNA in, for example, 
sera of subjects suspected of harboring the virus, or for 
screening donated blood for the presence of the virus. 
The HCV/Ji and HCV/J7 sequences also allow the design and 
production of HCV specific polypeptides which are useful 
as diagnostic reagents for the presence of antibodies 
raised during NANBH. Antibodies to purified polypeptides 
derived from the HCV/Ji and HCV/J7 sequences may also be 
used to detect viral antigens in infected individuals and 
25 in blood. 

Knowledge of these HCV/JI and HCV/J7 sequences 
also enable the design and production of polypeptides 
which may be used as vaccines against HCV and also for 
the production of antibodies, which in turn may be used 
30 for protection against the disease, and/or for therapy of 
HCV infected individuals. Moreover, the disclosed HCV/Ji 
and HCV/J7 sequences enable further characterization of 
the HCV genome. Polynucleotide probes derived from these 
sequences, as well as from the HCV genome, may be used to 
35 screen cDNA libraries for additional viral cDNA 
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sequences, which, in turn, nay be used to obtain 
additional overlapping sequences. See, e.g., EPO Pub. 
No. 318,216. 

The HCV/J1 and HCV/ J7 polynucleotide sequences, 

5 the polypeptides derived therefrom and the antibodies 
directed against these polypeptides, are useful in the 
isolation and identification of the BB-NANBV agent (s) . 
For example, antibodies directed against HCV epitopes 
contained in polypeptides derived from the HCV/Jl 

10 sequences may be used in processes based upon affinity 
chromatography to isolate the virus. Alternatively, the 
antibodies may be used to identify viral particles 
isolated by other techniques. The viral antigens and the 
genomic material within the isolated viral particles may 

15 then be further characterized. 

The information obtained from further 
sequencing of the HCV/Jl and HCV/J7 genome, as well as 
from further characterization of the HCV/Jl and HCV/J7 
antigens and characterization of the genomes enable the 

20 design and synthesis of additional probes and . 

polypeptides and antibodies which may be used for 
diagnosis, for prevention, and for therapy of HCV induced 
NANBH, and for screening for infected blood and 
blood-related products. 

25 The availability of HCV/Jl and HCV/J7 cDNA 

sequences permits the construction of expression vectors 
encoding antigenically active regions of the polypeptide 
encoded in either strand. These antigenically active 
regions may be derived from coat or envelope antigens or 

30 from core antigens, or from antigens which are 

non-structural including, for example, polynucleotide 
binding proteins, polynucleotide polymerase (s) , and other 
viral proteins required for the replication and/or as- 
sembly of the virus particle. Fragments encoding the 

35 desired polypeptides are derived from the cDNA clones 
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using conventional restriction digestion or by synthetic 
methods, and are ligated into vectors which may, for 
example, contain portions of fusion sequences such as 
beta-galactosidase or superoxide dismutase (SOD) . Methods 
5 and vectors which are useful for the production of 

polypeptides which contain fusion sequences of SOD are 
described in EPO Pub. No. 196,056. Vectors encoding 
fusion polypeptides of SOD and HCV polypeptides are 
described in EPO Pub. No. 318,216. Any desired portion 
of the HCV cDNA containing an open reading frame, in 
either sense strand, can be obtained as a recombinant 
polypeptide, such as a mature or fusion protein. 
Alternatively, a polypeptide encoded in the cDNA can be 
provided by chemical synthesis. 

The DNA encoding the desired polypeptide, 
whether in fused or mature form, and whether or not 
containing a signal sequence to permit secretion, may be 
ligated into expression vectors suitable for any 
convenient host. Both eukaryotic and prokaryotic host 
systems are presently used in forming recombinant 
polypeptides, and a summary of some of the more common 
control systems and host cell is given below. The 
polypeptide produced in such host cells is then isolated 
from lysed cells or from the culture medium and purified 
25 to the extent needed for its intended use. Purification 
nay be by techniques known in the art, for example, 
differential extraction, salt fractionation, 
chromatography on ion exchange resins, affinity 
chromatography, centrifugation, and the like. See, for 
example, Methods j n En zoology for a variety of methods 
for purifying proteins. 

Such recombinant or synthetic HCV polypeptides 
can be used as diagnostics, or those which give rise to 
neutralizing antibodies may be formulated into vaccines. 
Antibodies raised against these polypeptides can also be 
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used as diagnostics, or for passive immunotherapy* In 
addition, antibodies to these polypeptides are useful for 
isolating and identifying HCV particles. 

The HCV antigens may also be isolated from HCV 
5 virions. The virions may be grown in HCV infected cells 
in tissue culture, or in an infected host. 

While the polypeptides of the present invention 
may comprise a substantially complete viral domain, in 
many applications all that is required is that the 

10 polypeptide comprise an antigenic or immunogenic region 
of the virus. An antigenic region of a polypeptide is 
generally relatively small — typically 8 to 10 amino acids 
or less in length. Fragments of as few as 5 amino acids 
may characterize an antigenic region. These segments may 

15 correspond to regions of HCV/J1 or HCV/J& epitopes. Ac- 
cordingly, using the cDNAs of HCV/J1 and HCV/J7 as a 
basis, DNAs encoding short segments of HCV/Jl and HCV/J7 
polypeptides can be expressed recombinantly either as 
fusion proteins, or as isolated polypeptides. In addi- 

20 tion, short amino acid sequences can be conveniently 
obtained by chemical synthesis. 

In instances wherein the synthesized 
polypeptide is correctly configured so as to provide the 
correct epitope, but is too small to be immunogenic, the 

25 polypeptide may be linked to a suitable carrier. A 

number of techniques for obtaining such linkage are known 
in the art, including the formation of disulfide linkages 
using N-succinimidyl-3-(2-pyridyl-thio) propionate (SPDP) 
and succinimidyl 4-(N-maleimido-methyl)cyclohexane- 

30 l-carboxylate (SMCC) obtained from Pierce Company, 

Rockford, Illinois, (if the peptide lacks a sulfhydryl 
group, this can be provided by addition of a cysteine 
residue.) These reagents create a disulfide linkage 
between themselves and peptide cysteine residues on one 

35 protein and an amide linkage through the epsilon-amino on 
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a lysine, or other free amino group in the other. A 
variety of such disulf ide/amide-forming agents are known. 
See, for example, Immun. Rev. (1982) ££:185. Other 
bifunctional coupling agents form a thioether rather than 
5 a disulfide linkage. Many of these thio-ether-forming 
agents are commercially available and include reactive 
esters of 6-maleimidocaproic acid, 2-bromoacetic acid, 
2-iodoacetic acid, 4-(N-maleimido-methyl) cyclohexane- 
l-carboxylic acid, and the like. The carboxyl groups can 

10 be activated by combining them with succinimide or 
l-hydroxyl-2-nitro-4 -sulfonic acid, sodium salt. 
Additional methods of coupling antigens employs the 
rotavirus/"binding peptide" system described in EPO Pub. 
No. 259,149, the disclosure of which is incorporated 

15 herein by reference. The foregoing list is not meant to 
be exhaustive, and modifications of the named compounds 
can clearly be used. 

Any carrier may be used which does not itself 
induce the production of antibodies harmful to the host. 

20 Suitable carriers are typically large, slowly metabolized 
macromolecules such as proteins; polysaccharides, such as 
latex functionalized sepharose, agarose, cellulose, cel- 
lulose beads and the like; polymeric amino acids, such as 
polyglutamic acid, polylysine, and the like; amino acid 

25 copolymers; and inactive virus particles. Especially 
useful protein substrates are serum albumins, keyhole 
limpet hemocyanin, immunoglobulin molecules, 
thyroglobulin, ovalbumin, tetanus toxoid, and other 
proteins well known to those skilled in the art. 

30 in addition to full-length viral proteins, 

polypeptides comprising truncated HCV amino acid 
sequences encoding at least one viral epitope are useful 
immunological reagents. For example, polypeptides 
comprising such truncated sequences can be used as 

35 reagents in an immunoassay . These polypeptides also are 
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candidate subunit antigens in expositions for antiserum 
production or vaccines. while these truncated sequences 
can be produced by various known treatments of native 
viral protein, it is generally preferred to make 
synthetic or recombinant polypeptides comprising an Hcv 
sequence. Polypeptides comprising these truncated HCV 
sequences can be made up entirely of HCV sequences (one 
or more epitopes, either contiguous or noncontiguous) , or 
HCV sequences and heterologous sequences in a fusion 
protein. Useful heterologous sequences include sequences 
that provide for secretion from a recombinant host, 
enhance the immunological reactivity of the HCV 
epitopes,, or facilitate the coupling of the polypeptide 
to an immunoassay support or a vaccine carrier. £ e 

**' No - ii6 - 2oi; °- s - pat - n °- «.7«,.4o; 

EPO Pub. NO. 259,149; U.S. Pat . No . 4/629 , 783f ^ ' 
closures of which are incorporated herein by reference. 

The size of polypeptides comprising the 
truncated HCV sequences can vary widely, the minimum size 
being a sequence of sufficient size to provide an HCV 
epitope while the maximum size is not critical. In some 
applications, the maximum size usually is not 
substantially greater than that required to provide the 

25 seT^ ePlt ° PeS fUnCti ° n <*> «* the heterologous 
25 sequence, if any. Typically, the truncated HCV amino 

acid sequence will range from about 5 to about 100 amino 
acids m length. More typically, however, the HCV 
sequence will be a maximum of about 50 amino acids in 
30 preferablv a aaximum of about 30 amino acids, it 

30 is usually desirable to select HCV sequences of at leasf 
about io, „ or « amino acids, up to a maximum of about 
20 or 25 amino acids. 

Truncated HCV amino acid sequences comprising 
epitopes can be identified in a number of ways. For 
35 example, the entire viral protein sequence can be 
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screened by preparing a series of short peptides that 
together span the entire protein sequence. By starting 
with, for example, 100-mer polypeptides, it would be 
routine to test each polypeptide for the presence of 
5 epitope (s) showing a desired reactivity, and then testing 
progressively smaller and overlapping fragments from an 
identified 100-mer to map the epitope of interest. 
Screening such peptides in an immunoassay is within the 
skill of the art. It is also known to carry out a 

10 computer analysis of a protein sequence to identify 
potential epitopes, and then prepare oligopeptides 
comprising the identified regions for screening. It is 
appreciated by those of skill in the art that such 
computer analysis of antigenicity does not always 

15 identify an epitope that actually exists, and can also 
incorrectly identify a region of the protein as 
containing an epitope. 

The observed relationship of the putative 
polyproteins of HCV and the Flavi viruses allows a 

20 prediction of the putative domains of the HCV 

"non-structural" (NS) proteins. The locations of the 
individual NS proteins in the putative Flavivirus 
precursor polyprotein are fairly well-known. Moreover, 
these also coincide with observed gross fluctuations in 

25 the hydrophobicity profile of the polyprotein. It is 

established that NS5 of Flaviviruses encodes the virion 
polymerase, and that NSl corresponds with a complement 
fixation antigen which has been shown to be an effective 
vaccine in animals. Recently, it has been shown that a 

30 flaviviral protease function resides in NS3. Due to the 
observed similarities between HCV and the Flaviviruses, 
deductions concerning the approximate locations of the 
corresponding protein domains and functions in the HCV 
polyprotein are possible. Figure 11 is a schematic of 

35 putative domains of the HCV polyprotein. The expression 
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of polypeptides containing these domains in a variety of 
recombinant host cells, including, for example, bacteria, 
yeast, insect, and vertebrate cells, should give rise to 
important immunological reagents which can be used for 
5 diagnosis, detection, and vaccines. 

Although the non-structural protein region of 
the putative polyproteins of the HCV isolate described 
herein and of Flaviviruses appears to be generally 
similar, there is less similarity between the putative 

10 structural regions which are towards the N-terminus. In 
this region, there is a greater divergence in sequence, 
and in addition, the hydrophobic profile of the two 
regions show less similarity. This "divergence" begins 
in the N-terminal region of the putative NS1 domain in 

15 HCV, and extends to the presumed N-terminus. 

Nevertheless, it is still possible to predict the 
approximate locations of the putative nucleocapsid 
(N-terminal basic domain) and E (generally hydrophobic) 
domains within the HCV polyprotein. From these 

20 predictions it may be possible to identify approximate 

regions of the HCV polyprotein that could correspond with 
useful immunological reagents. For example, the E and 
NS1 proteins of Flaviviruses are known to have efficacy 
as protective vaccines. These regions, as well as some 

25 which are shown to be antigenic in the HCVl, for example 
those within putative NS3, C, and NS5, etc., should also 
provide diagnostic reagents. 

The immunogenicity of the HCV sequences may 
also be enhanced by preparing the sequences fused to or 

30 assembled with particle-forming proteins such as, for 
example, hepatitis B surface antigen or rotavirus VP6 
antigen. Constructs wherein the HCV epitope is linked 
directly to the particle-forming protein coding sequences 
produce hybrids which are immunogenic with respect to the 

35 HCV epitope. In addition, all of the vectors prepared 
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include epitopes specific to HBV, having various degrees 
of immunogenicity, such as, for example, the pre-S 
peptide. Thus, particles constructed from particle form- 
ing protein which include HCV sequences are immunogenic 
5 with respect to HCV and particle-form protein. See, 

e.g., U.S. Pat. No. 4,722,840; EPO Pub No. 175,261; EPO 
Pub. No. 259,149; Michelle et al. (1984) Int. Symposium 
on Viral Hepatitis. 

Vaccines may be prepared from one or more im- 

10 munogenic polypeptides derived from HCV/J1 or HCV/J7. 
The observed homology between HCV and Flaviviruses 
provides information concerning the polypeptides which 
are likely to be most effective as vaccines, as well as 
the regions of the genome in which they are encoded. The 

15 general structure of the Flavivirus genome is discussed 

in Rice et al. (1986) in THE VIRUSES: THE TOGAVIRIDAE AND 
FLAVIVTRIDAE (Series eds. Fraenkel-Conrat and Wagner, Vol 
eds. Schlesinger and Schlesinger, Plenum Press). The 
flavivirus genomic RNA is believed to be the only 

20 virus-specific mRNA species, and it is translated into 
the three viral structural proteins, i.e., C, M, and E, 
as well as two large nonstructural proteins, NV4 and NV5, 
and a complex set of smaller nonstructural proteins. It 
is known that major neutralizing epitopes for 

25 Flaviviruses reside in the E (envelope) protein. Roehrig 
(1986) in THE VIRUSES: THE TOGAVIRIDAE AND FLAVIVIRIDAE 
(Series eds. Fraenkel-Conrat and Wagner, Vol eds. 
Schlesinger and Schlesinger, Plenum Press). The 
corresponding HCV E gene and polypeptide encoding region 

30 may be predicted, based upon the homology to 

Flaviviruses. Thus, vaccines may be comprised of re- 
combinant polypeptides containing epitopes of HCV E. 
These polypeptides may be expressed in bacteria, yeast, 
or mammalian cells, or alternatively may be isolated from 

35 viral preparations. It is also anticipated that the 
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other structural proteins may also contain epitopes 
give rase to protective anti-HCV antibodies. Thus 
polypeptides containing the epitopes of E/ C/ and M may 
also be used, whether singly or in combination, in HCV 
5 vaccines . 

in addition to the above, it has been shown 
that immunization with NSl (nonstructural protein 1) 
results in protection against yellow fever. Schlesinger 
et al (1986, J. Virol. fflxlia3 . ^ is ^ even ^ 
the immunization does not give rise to neutralizing 
antibodies. Thus, particularly since this protein 
appears to be highly conserved among Flaviviruses , it is 
likely that HCV nsi will also be protective against HCV 
infection. Moreover, it also shows that nonstructural 
proteins may provide protection against viral 
pathogenicity, even if they do not cause the production 
of neutralizing antibodies. 

In view of the above, multivalent vaccines 
against HCV may be comprised of one or more epitopes from 
one or more structural proteins, and/or one or more 
epitopes from one or more nonstructural proteins. These 
vaccines may be comprised of, for example, recombinant 
HCV polypeptides and/or polypeptides isolated from the 
virions. i„ particular, vaccines are contemplated 
comprising one or more of the following HCV proteins, or 
subunit antigens derived therefrom: E, NSl, c, NS2, NS3 , 
NS4andNS5. Particularly preferred are vaccines 
comprising E and/or NSl, or subunits thereof. i„ 
addition, it may be possible to use inactivated HCV in 
vaccines; inactivation may be by the preparation of viral 
lysates, or by other means known in the art to cause 
inactivation of Flaviviruses, for example, treatment with 
organic solvents or detergents, or treatment with 

35 lir &1± l\ M ° reOVer ' VSCCineS Day alS ° be P^e* ^om 
35 attenuated HCV strains or from hybrid viruses such as 
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vaccinia vectors known in the art [Brown et al. Nature 
319 : 549-550 (1986)]. 

The preparation of vaccines which contain 
immunogenic polypeptide (s) as active ingredients is known 
5 to one skilled in the art. Typically, such vaccines are 
prepared as injectables, either as liquid solutions or 
suspensions; solid forms suitable for solution in, or 
suspension in, liquid prior to injection may also be 
prepared. The preparation may also be emulsified, or the 

10 protein encapsulated in liposomes. The active 

immunogenic ingredients are often mixed with excipients 
which are pharmaceutical ly acceptable and compatible with 
the active ingredient. Suitable excipients are, for 
example, water, saline, dextrose, glycerol, ethanol, or 

15 the like and combinations thereof. In addition, if 
desired, the vaccine may contain minor amounts of 
auxiliary substances such as wetting or emulsifying 
agents, pH buffering agents, and/or adjuvants which 
enhance the effectiveness of the vaccine. Examples of 

20 adjuvants which may be effective include but are not 
limited to: aluminum hydroxide, N-acetyl-muramylHL- 
threonyl-D-isoglutamine (thr-MDP) , N-acetyl-nor-muramyl- 
L-alanyl-D-isoglutamine (CGP 11637, referred to as 
nor-MDP) , N-acetylmuramyl-L-alanyl-D-isoglutaminyl- 

25 L-alanine-2- (l'-2 '-dipalmitoyl-sn-glycerol-3- 

hydroxyphosphoryloxy) -ethylamine (CGP 19835A, referred to 
as MTP-PE) , and RIBI, which contains three components 
extracted from bacteria, monophosphoryl lipid A, 
trehalose dimycolate and cell wall skeleton (MPL+TDM+CWS) 

30 in a 2% squalene/Tween 80 emulsion. The effectiveness of 
an adjuvant may be determined by measuring the amount of 
antibodies directed against an immunogenic polypeptide 
containing an HCV antigenic sequence resulting from 
administration of this polypeptide in vaccines which are 

35 also comprised of the various adjuvants. 
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The vaccines are conventionally administered 
parenterally, by injection, usually, either subcutane- 
ously or intramuscularly. Additional formulations which 
are suitable for other modes of administration include 
5 suppositories and, in some cases, oral formulations. For 
suppositories, traditional binders and carriers may 
include, for example, polyalkylene glycols or 
triglycerides; such suppositories may be formed from 
mixtures containing the active ingredient in the range of 

10 0.5% to 10%, preferably l%-2%. Oral formulations include 
such normally employed excipients as, for example, 
pharmaceutical grades of mannitol, lactose, starch, 
magnesium stearate, sodium saccharine, cellulose, 
magnesium carbonate, and the like. These compositions 

15 take the form of solutions, suspensions, tablets, pills, 
capsules, sustained release formulations or powders and 
contain 10%-95% of active ingredient, preferably 25%-70%. 

The proteins may be formulated into the vaccine 
as neutral or salt forms. Pharmaceutically acceptable 

20 salts include the acid addition salts (formed with free 
amino groups of the peptide) and which are formed with 
inorganic acids such as, for example, hydrochloric or 
phosphoric acids, or such organic acids such as acetic, 
oxalic, tartaric, maleic, and the like. Salts formed 

25 with the free carboxyl groups may also be derived from 
inorganic bases such as, for example, sodium, potassium, 
ammonium, calcium, or ferric hydroxides, and such organic 
bases as isopropylamine, trimethylamine, 2-ethylamino 
ethanol, histidine, procaine, and the like. 

30 The vaccines are administered in a manner 

compatible with the dosage formulation, and in such 
amount as will be prophylactically and/or therapeutically 
effective. The quantity to be administered, which is 
generally in the range of 5 micrograms to 250 micrograms 

35 of antigen per dose, depends on the subject to be 
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treated, capacity of the subject's immune system to 
synthesize antibodies, and the degree of protection 
desired. Precise amounts of active ingredient required 
to be administered may depend on the judgment of the 
5 practitioner and may be peculiar to each subject. 

The vaccine may be given in a single dose 
schedule, or preferably in a multiple dose schedule* A 
multiple dose schedule is one in which a primary course 
of vaccination may be with 1-10 separate doses, followed 

10 by other doses given at subsequent time intervals 
required to maintain and or reenforce the immune 
response, for example, at 1-4 months for a second dose, 
and if needed, a subsequent dose(s) after several months. 
The dosage regimen will also, at least in part, be 

15 determined by the need of the individual and be dependent 
upon the judgment of the practitioner. 

In addition, the vaccine containing the im- 
munogenic HCV antigen (s) may be administered in conjunc- 
tion with other immunoregulatory agents, for example, im- 

20 mune globulins. 

The immunogenic polypeptides prepared as 
described above are used to produce antibodies, both 
polyclonal and monoclonal. If polyclonal antibodies are 
desired, a selected mammal (e.g., mouse, rabbit, goat, 

25 horse, etc.) is immunized with an immunogenic polypeptide 
bearing an HCV epitope (s) . Serum from the immunized 
animal is collected and treated according to known 
procedures. If serum containing polyclonal antibodies to 
an HCV epitope contains antibodies to other antigens, the 

30 polyclonal antibodies can be purified by immunoaff inity 
chromatography. Techniques for producing and processing 
polyclonal antisera are known in the art, see for 
example, Mayer and Walker, eds. (1987) IMMUNOCHEMICAL 
METHODS IN CELL AND MOLECULAR BIOLOGY (Academic Press, 

35 London) . 
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Monoclonal antibodies directed against HCV 

ZTIT Z alS ° ^ by ° ne *iH- ^ 

T r eral meth0d010 ^ *« «*iag monoclonal 
antibodies by hybridomas is well known, 1^0^ 

antibody-producing cell lines can be created by cell f u - 

fo^at^ V ^ ^ t6ChniqUeS SUCh - ^ect lans. 
formation of B lymphocytes with oncogenic DNA or 

transfection with Epstein-Barr virus. See, e'g „ 

Schreier et al. (1980) HYBBIDOMA TECHNIQUES ; Having 

10 et al. (1981, , MONOCLONAL ANTIBODIES AND T-CELL^ 

SfiS Also, U.S. Patent Nos. 4,341,761; 4,399.121, 
4,427,783; 4,444,887; 4,466,917; 4,472,500; 4,491,632- 

15 !" d . 4 V 493 ' 890 ' Panels ° f — lonal antibodies produced 
15 against HCV epitopes can be screened for various 

properties; i.e., for isotype, epitope affinity, etc. 

Antibodies, both monoclonal and polyclonal, 

useful^ d d iaT ted ^ 6PitOPeS ™ P^icul^ 

20 useful £ ttZTV ^ th ° Se WhiCh a " are 
Z llt T inBBUnothe «Py- Monoclonal antibodies, 

llTATs 1 . ^ " USed t0 — —-type 

whioh totl - idlrt We antibodies are immunoglobulins 
which carry an -internal image" of the antigen of the 
25 infects agent against which protection is desleT 

L~ f0 s r er ing anti " idi0t ^ are known 

mLT ' 6,g " ° rZyCh (1985) ' Nature 3ifi: 74; 

a ("ST 1 a i' (1984) ' SCienCe ^ II325 ' ^aag'et 
30 ILZI ^ :1225 - These ^i-idiotype 

30 antibodies may also be useful for treatment and/or 

diagnosis of NANBH, as well as for an elucidation of the 
immunogenic regions of HCV antigens. 

Using the HCV/Ji or HCV/J7 polynucleotide 

35 3S 3 baSiS ' ° li90merS ° f a ^--ately . 

35 nucleotides or more can be prepared, either by excision 
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or synthetically, which hybridize with the HCV genoae and 
are useful i n identification of the viral agent (s) 
further characterization of the viral genoae <s) , as well 
as xn detection of the virus(es) in diseased individuals. 
5 The probes for HCV polynucleotides (natural or derived) 
are a length which allows the detection of unique viral 
sequences by hybridization. While 6-8 nucleotides aay be 
a workable length, sequences of about 10-12 nucleotides 

10 T Prefer " d ' and about 2 ° nucleotides appears optimal. 
10 These probes can be prepared using routine aethods 

including autoaated oligonucleotide synthetic aethods. 
Among useful probes, for exaaple, are the clones 
disclosed herein, as well as the various oligoaers useful 
m probxng cdna libraries, set forth below. A coapleaent 
15 to any unique portion of the HCV genoae will be satisfac- 
tory. For use as probes, complete conpleaentarity is 
desxrable, though it aay be unnecessary as the length of 
the fragment is increased. 

For use of such probes as diagnostics, the bio- 
20 logxcal sample to be analyzed, such as blood or serum 

may be treated, if desired, to extract the nucleic acids 
contained therein. The resulting nucleic acid froa the 
saaple may be subjected to gel electrophoresis or other 
size separation techniques; alternatively, the nucleic 
acxd sample may be dot blotted without size separation. 
The probes are then labeled. Suitable labels, and 
aethods for labeling probes are known in the art, and 
include, for example, radioactive labels incorporated by 
30 T h tra " Slati ° n ° r * lnasi *9, Matin, fluorescent probes, 
30 and cheailuminescent probes. The nucleic acids extracted 

llZr hl Sa ! Ple th6n tr6ated the lab <^ P-be 

la"™ : tion condltions of suitabie 

Usually hxgh strxngency conditions are desirable in order 
to prevent false positives. The stringency of 
hybridization is determined by a number of factors dur- 
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ing hybridization and during the washing procedure, 

including temperature, ionic strength, length of time, 

and concentration of formamide. These factors are 

outlined in, for example, Maniatis, T. (1982) MOLECULAR 

5 CLONING; A LABORATORY MANUAL (Cold Spring Harbor Press, 

Cold Spring Harbor, N.Y.). 

Generally, it is expected that the HCV genome 

sequences will be present in serum of infected 

individuals at relatively low levels, i.e., at ap- 
2 3 

10 proximately 10 -10 chimp infectious -doses (CID) per ml. 
This level may require that amplification techniques be 
vised in hybridization assays. Such techniques are known 
in the art. For example, the Enzo Biochemical 
Corporation w Bio-Bridge" system uses terminal 

15 deoxynucleotide transferase to add unmodif ied 

3'-poly-dT-tails to a DNA probe. The poly dT-tailed 
probe is hybridized to the target nucleotide sequence, 
and then to a biotin-modif ied poly-A. PCT App. No. 
84/03520 and EPO Pub. No. 124,221 describe a DNA 

20 hybridization assay in which: (1) analyte is annealed to 
a single-stranded DNA probe that is complementary to an 
enzyme-labeled oligonucleotide; and (2) the resulting 
tailed duplex is hybridized to an enzyme-labeled 
oligonucleotide. EPO Pub. No. 204,510 describes a DNA 

25 hybridization assay in which analyte DNA is contacted 

with a probe that has a tail, such as a poly-dT tail, an 
amplifier strand that has a sequence that hybridizes to 
the tail of the probe, such as a poly-A sequence, and 
which is capable of binding a plurality of labeled 

30 strands. 

A particularly desirable technique may first 
involve amplification of the target HCV sequences in sera 
approximately 10,000-fold, i.e., to approximately 10 6 
sequences/ml. This may be accomplished, for example, by 
35 the polymerase chain reactions (PGR) technique described 
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which is by Saiki et al. (1986) Nature 121: 163 , Mullis, 
U.S. Patent No. 4,683,195, and Mullis et al. U.S. Patent 
No. 4,683,202. The amplified sequence (s) may then be 
detected using a hybridization assay which is described 
5 in co-pending European Publication No. 317-077 and 

Japanese application No. 63-260347, which are assigned to 
the herein assignee, and are hereby incorporated herein 
by reference. These hybridization assays, which should 
detect sequences at the level of 10 6 /ml, utilize nucleic 

10 acid multimers which bind to single-stranded analyte 
nucleic acid, and which also bind to a multiplicity of 
single-stranded labeled oligonucleotides. A suitable 
solution phase sandwich assay which may be used with 
labeled polynucleotide probes, and the methods for the 

15 preparation of probes is described in EPO Pub. No. 

225,807 which is hereby incorporated herein by refer- 
ence. 

The probes can be packaged into diagnostic 
kits. Diagnostic kits include the probe DNA, which may 

20 be labeled; alternatively, the probe DNA may be unlabeled 
and the ingredients for labeling may be included in the 
kit in separate containers. The kit may also contain 
other suitably packaged reagents and materials needed for 
the particular hybridization protocol, for example, 

25 standards, wash buffers, as well as instructions for 
conducting the test. 

Both the HCV/J1 or HCV/J7 polypeptides which 
react immunologically with serum containing HCV 
antibodies and the antibodies raised against the HCV 

30 specific epitopes in these polypeptides are useful in 

immunoassays to detect presence of HCV antibodies, or the 
presence of the virus and/or viral antigens, in bio- 
logical samples. Design of the immunoassays is subject 
to a great deal of variation, and a variety of these are 

35 known in the art. An immunoassay for anti-HCV antibody 
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may utilize one viral epitope or several viral epitopes. 
When multiple epitopes are used, the epitopes may be 
derived from the same or different viral polypeptides, 
and may be in separate recombinant or natural 
5 polypeptides, or together in the same recombinant 
polypeptides. 

An immunoassay for viral antigen may use, for 
example, a monoclonal antibody directed towards a viral 
epitope, a combination of monoclonal antibodies directed 

10 towards epitopes of one viral polypeptide, monoclonal 
antibodies directed towards epitopes of different viral 
polypeptides, polyclonal antibodies directed towards the 
same viral antigen, polyclonal antibodies directed 
towards different viral antigens or a combination of 

15 monoclonal and polyclonal antibodies. 

Immunoassay protocols may be based, for 
example, upon competition, or direct reaction, or 
sandwich type assays. Protocols may also, for example, 
use solid supports, or may be by immunoprecipitation. ' 

20 Most assays involve the use of labeled antibody or 
polypeptide. The labels may be, for example, 
fluorescent, chemi luminescent, radioactive, or dye 
molecules. Assays which amplify the signals from the 
probe are also known. Examples of which are assays 

25 which utilize biotin and avidin, and enzyme-labeled and 
mediated immunoassays, such as ELISA assays. 

Typically, an immunoassay for anti-HCV antibody 
will involve selecting and preparing the test sample, 
such as a biological sample, and then incubating it with 

30 an antigenic (i.e., epitope-containing) HCV polypeptide 
under conditions that allow antigen-antibody complexes to 
form. Such conditions are well known in the art. m a 
heterogeneous format, the polypeptide is bound to a solid 
support to facilitate separation of the sample from the 

35 polypeptide after incubation. Examples of solid supports 
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that can be used are nitrocellulose, in membrane or 
microtiter well form, polyvinylchloride, in sheets or 
microtiter wells, polystyrene latex, in beads or 
microtiter plates, polyvinyl idine fluoride, known as 
5 Immobulon™, diazotized paper, nylon membranes, activated 
beads, and Protein A beads. Most preferably, the 
Dynatech, Immulon™ l microtiter plate or the 0.25-inch 
polystyrene beads, which Spec finished by Precision 
Plastic Ball, are used in the heterogeneous format. The 
solid support is typically washed after separating it 
from the test sample, m a homogeneous format, the test 
sample is incubated with antigen in solution, under 
conditions that will precipitate any antigen-antibody 
complexes that are formed, as is know in the art. The 
precipitated complexes are then separated from the test 
sample, for example, by centrifugation. The complexes 
formed comprising anti-HCV antibody are then detected by 
any of a number of techniques. Depending on the format, 
the complexes can be detected with labeled anti- 
xenogeneic Ig or, if a competitive format is used, by 
measuring the amount of bound, labeled competing 
antibody. 

In immunoassays where HCV polypeptides are the 
analyte, the test sample, typically a biological sample, 
is incubated with anti-HCV antibodies again under 
conditions that allow the formation of antigen-antibody 
complexes. Various formats can be employed, such as a 
-sandwich- assay where antibody bound to a solid support 
is incubated with the test sample; washed; incubated with 
a second, labeled antibody to the analyte; and the 
support is washed again. Analyte is detected by 
determining if the second antibody is bound to the 
support, in a competitive format, which can be either 
heterogeneous or homogeneous, a test sample is usually 
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incubated with and antibody and a labeled, competing 
antigen either sequentially or simultaneously. These and 
other formats are well known in the art. 

The Flavivirus model for HCV allows predictions 
5 regarding the likely location of diagnostic epitopes for 
the virion structural proteins. The C, pre-M, M f and E 
domains are all likely to contain epitopes of significant 
potential for detecting viral antigens, and particularly 
for diagnosis. Similarly, domains of the nonstructural 

10 proteins are expected to contain important diagnostic 

epitopes (e.g., NS5 encoding a putative polymerase; and 
NS1 encoding a putative complement-binding antigen) . Re- 
combinant polypeptides, or viral polypeptides, which 
include epitopes from these specific domains may be 

15 useful for the detection of viral antibodies in 

infections blood donors and infected patients. In 
addition, antibodies directed against the E and/ or M 
proteins can be used in immunoassays for the detection of 
viral antigens in patients with HCV caused NANBH, and in 

20 infectious blood donors. Moreover, these antibodies may 
be extremely useful in detecting acute-phase donors and 
patients • 

Antigenic regions of the putative polyprotein 
can be mapped and identified by screening the 

25 antigenicity of bacterial expression products of HCV 

cDNAs which encode portions of the polyprotein. Other 
antigenic regions of HCV may be detected by expressing 
the portions of the HCV cDNAs in other expression 
systems, including yeast system^ and cellular systems 

30 derived from insects and vertebrates. In addition, 
studies giving rise to an antigenicity index and 
hydrophobicity/hydrophilicity profile give rise to 
information concerning the probability of a region's 
antigenicity. Efficient detection systems may include 
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the use of panels of epitopes. The epitopes in the panel 
may be constructed into one or multiple polypeptides. 

Kits suitable for immunodiagnosis and contain- 
ing the appropriate labeled reagents are constructed by 
5 packaging the appropriate materials, including the 

polypeptides of the invention containing HCV epitopes or 
antibodies directed against HCV epitopes in suitable 
containers, along with the remaining reagents and materi- 
als required for the conduct of the assay (e.g., wash 
10 buffers, detection means like labeled anti-human Ig, 

labeled anti-HCV, or labeled HCV antigen) , as well as a 
suitable set of assay instructions. 

The HCV/Jl and HCV/J7 nucleotide sequence 
information described herein may be used to gain further 
15 information on the sequence of the HCV genomes, and for 
identification and isolation of additional HCV isolates 
related to Jl or J7. This information, in turn, can lead 
to additional polynucleotide probes, polypeptides derived 
from the HCV genome, and antibodies directed against HCV 
20 epitopes which would be useful for the diagnosis and/or 
treatment of HCV caused NANBH. 

The HCV/Jl and HCV/J7 nucleotide sequence 
information herein is useful for the design of probes for 
the isolation of additional sequences which are derived 
25 from as yet undefined regions of the HCV genomes from 

which the Jl and J7 sequences are derived. For example, 
labeled probes containing a sequence of approximately 8 
or more nucleotides, and preferably 20 or more 
nucleotides, which are derived from regions close to the 
30 5 '-termini or 3 '-termini of the family of HCV cDNA 
sequences disclosed in the examples may be used to 
isolate overlapping cDNA sequences from HCV cDNA 
libraries. These sequences which overlap the cDNAs in 
the above-mentioned clones, but which also contain 
sequences derived from regions of the genome from which 
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the cDNA in the above mentioned clones are not derived , 
may then be used to synthesize probes for identification 
of other overlapping fragments which do not necessarily 
overlap the cDNAs described below. Methods for 
5 constructing cDNA libraries are known in the art. See, 
e.g. EPO Pub. No. 318,216. It is particularly preferred 
to prepare libraries from the serum of Japanese and other 
Asian patients diagnosed as having NANBH demonstrating 
antibody to HCV1 antigens; these are believed to be the 

10 most likely candidates for carriers of HCV/Jl, HCV/J7, or 
related isolates. 

HCV particles may be isolated from the sera 
from individuals with NANBH or from cell cultures by any 
of the methods known in the art, including for example, 

15 techniques based on size discrimination such as 

sedimentation or exclusion methods, or techniques based 
on density such as ultracentrifugation in density 
gradients, or precipitation with agents such as 
polyethylene glycol, or chromatography on a variety of 

20 materials such as anionic or cationic exchange materials, 
and materials which bind due to hydrophobicity. 

A preferred method of isolating HCV. particles 
or antigen is by immunoaff inity columns. Techniques for 
immunoaffinity chromatography are known in the art, 

25 including techniques for affixing antibodies to solid 
supports so that they retain their immunoselective 
activity. The techniques may be those in which the anti- 
bodies are adsorbed to the support (see, for example, 
Kurstak in ENZYME IMMUNODIAGNOSIS, page 31-37), as well 

30 as those in which the antibodies are covalently linked to 
the support. Generally, the techniques are similar to 
those used for covalent linking of antigens to a solid 
support, described above. However, spacer groups may be 
included in the bifunctional coupling agents so that the 

35 antigen binding site of the antibody remains accessible. 
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The antibodies may be monoclonal, or polyclonal, and it 
may be desirable to purify the antibodies before their 
use in the immunoassay. 

The general techniques used in extracting the 
5 genome from a virus, preparing and probing a cDNA 
library, sequencing clones, constructing expression 
vectors, transforming cells, .performing immunological 
assays such as radioimmunoassays and ELISA assays, for 
growing cells in culture, and the like are known in the 
art and laboratory manuals are available describing these 
techniques. However, as a general guide, the following 
sets forth some sources currently available for such 
procedures, and for materials useful in carrying them 
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Both prokaryotic and eukaryotic host cells may 
be used for expression of desired coding sequences when 
appropriate control sequences which are compatible with 
the designated host are used. Among prokaryotic hosts, 

Sfili is most frequently used. Expression control 
sequences for prokaryotes include promoters, optionally 
containing operator portions, and ribosome binding sites. 
Transfer vectors compatible with prokaryotic hosts are 
commonly derived from, for example, pBR322, a plasmid 
containing operons conferring ampicillin and tetracycline 
resistance, and the various puc vectors, which also 
contain sequences conferring antibiotic resistance mark- 
ers. These markers may be used to obtain successful 
transformants by selection. Commonly used prokaryotic 
control sequences include the Beta-lactamase 
(penicillinase) and lactose promoter systems (Chang et 
al.(l977), Nature 19a: 1056, the tryptophan (trp) promoter 
system (Goeddel et al. (i 98 0) Nucleic Acid Res. 1:4057), 
and the lambda-derived P L promoter and N gene ribosome ' 
binding site (Shimatake et al. (i 98 i) Nature 222:128) and 
the hybrid £§c promoter (De Boer et al. (1983) Proc. 
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Natl. Acad. sci. USA 291:128) derived fro* sequences of 
the to and lac uvs promoters. The foregoing systems are 
particularly compatible with sail; if desired, other 
prokaryotic hosts such as strains of Bacillus or 
5 Pseudoaonas may be used, with corresponding control 
sequences . 

Eukaryotic hosts include yeast and mammalian 
cells in culture systems. Sacchar^y,^ sereyisiag 
■ S^charprcy c.s carlsberq ens is , glgfe^ and 
ESStSEls are the most commonly used yeast hosts, and are 
convenient fungal hosts. Yeast compatible vectors carry 
markers which permit selection of successful 
transformants by conferring prototrophy to auxotrophic 
mutants or resistance to heavy metals on wild-type 
strains. Yeast Compatible vectors may employ the 2 
micron origin of replication (Broach et al. (1983) Math 
Enz. 101:307), the combination of CEN3 and ARSi or other 
means for assuring replication, such as sequences which 

20 Into ITT ^ inCOrp ° ration of appropriate fragment 
20 into the host cell genome. Control sequences for yeast 
vectors are known in the art and include promoters for 
the synthesis of glycolytic enzymes (Hess et al. (i 96 8) 
J. Adv. Enzyme Eng. 7: 149; Holland et al. (i 978 ) j 
Biol. chem. 25S.-1385), including the promoter for 3* 
Phosphoglycerate kinase (Hitzeman (1980), j. B iol. chem. 
255:2073). Terminators may also be included, such as 
those derived from the enolase gene (Holland (issi) , j 
Biol. Chem. 256:1385). Particularly useful control 
systems are those which comprise the glyceraldehyde-3 
Phosphate dehydrogenase (GAPDH) promoter or alcohol 
dehydrogenase (ADH) regulable promoter, terminators 
also derived from GAPDH, and if secretion is desired 
leader sequence from yeast alpha factor, m addition, 
the transcriptional regulatory region and the 
transcriptional initiation region which are operably 
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1 inked may be such that they are not naturally associated 
in the wild-type organism. These systems are described 
in detail in EPO Pub. No. 120,551; EPO Pub. No. 116,201; 
and EPO Pub. No. 164,556 all of which are incorporated 
5 herein by reference. 

Mammalian cell lines available as hosts for 
expression are known in the art and include many 
immortalized cell lines available from the American Type 
Culture Collection (ATCC) , including HeLa cells, Chinese 

10 hamster ovary (CHO) cells, baby hamster kidney (BHK) 
cells, and a number of other cell lines. Suitable 
promoters for mammalian cells are also known in the art 
and include viral promoters such as that from Simian 
Virus 40 (SV40) (Fiers (1978), Nature 273 : 113) , Rous 

15 sarcoma virus (RSV) , adenovirus (ADV) , and bovine 

papilloma virus (BPV) . Mammalian cells may also require 
terminator sequences and poly A addition sequences; 
enhancer sequences which increase expression may also be 
included, and sequences which cause amplification of the 

20 gene may also be desirable. These sequences are known in 
the art. Vectors suitable for replication in mammalian 
cells may include viral replicons, or sequences which 
insure integration of the appropriate sequences encoding 
NANBV epitopes into the host genome. 

25 The vaccinia virus system can also be used to 

express foreign DNA in mammalian cells. To express 
heterologous genes, the foreign DNA is usually inserted 
into the thymidine kinase gene of the vaccinia virus and 
then infected cells can be selected. This procedure is 

30 known in the art and further information can be found in 
these references [Mackett et al. J. Virol. U>: 857-864 
(1984) and Chapter 7 in DNA Cloning . Vol. 2, IRL Press]. 

In addition, viral antigens can be expressed in 
insect cells by the Baculovirus system. A general guide 

35 to baculovirus expression by Summer and Smith is A Manual 
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of Methods for Baculovirus Vectors and Insect Cell 
Culture Procedures (Texas Agricultural Experiment Station 
Bulletin No. 1555) . To incorporate the heterologous gene 
into the Baculovirus genome the gene is first cloned into ? 
5 a transfer vector containing some Baculovirus sequences. 

THis transfer vector, when it is cotransf ected with wild- * 
type virus into insect cells, will recombine with the 
wild-type virus. Usually, the transfer vector will be 
engineered so that the heterologous gene will disrupt the 

10 wild-type Baculovirus polyhedron gene. This disruption 
enables easy selection of the recombinant virus since the 
cells infected with the recombinant virus will appear 
phenotypically different from the cells infected with the 
wild-type virus. The purified recombinant virus can be 

15 used to infect cells to express the heterologous gene. 

The foreign protein can be secreted into the medium if a 
signal peptide is linked in frame to the heterologous 
gene; otherwise, the protein will be bound in the cell 
lysates. For further information, see Smith et al Mol. & 

20 Cell. Biol. 2:2156-2165 (1983) or Luckow and Summers in 
Virology 17: 31-39 (1989). 

Transformation may be by any known method for 
introducing polynucleotides into a host cell, including, 
for example packaging the polynucleotide in a virus and 

25 transducing a host cell with the virus, and by direct 
uptake of the polynucleotide. The transformation 
procedure used depends upon the host to be transformed. 
Bacterial transformation by direct uptake generally 
employs treatment with calcium or rubidium chloride 

30 (Cohen (1972), Proc. Natl. Acad. Sci. USA 61:2110; 

Maniatis et al. (1982), MOLECULAR CLONING; A LABORATORY 
MANUAL (Cold Spring Harbor Press, Cold Spring Harbor, 4 
N.Y.). Yeast transformation by direct uptake may be 
carried out using the method of Hinnen et al. (1978) « 

35 Proc. Natl. Acad. Sci. USA 75: 1929. Mammalian 
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transformations by direct uptake may be conducted using 
the calcium phosphate precipitation method of Graham and 
Van der Eb (1978), Virology 52:546 or the various known 
modifications thereof. 
5 Vector construction employs techniques which 

are known in the art. Site-specific DNA cleavage is 
performed by treating with suitable restriction enzymes 
under conditions which generally are specified by the 
manufacturer of these commercially available enzymes. 

10 The cleaved fragments may be separated using 

polyacrylamide or agarose gel electrophoresis techniques, 
according to the general procedures found in Methods in 
Enzymology (1980) ££: 499-560. Sticky ended cleavage 
fragments may be blunt ended using £. coli DNA polymerase 

15 I (Klenow) in the presence of the appropriate 

deoxynucleotide triphosphates (dNTPs) present in the 
mixture. Treatment with SI nuclease may also be used, 
resulting in the hydrolysis of any single stranded DNA 
portions . 

20 Ligations are carried out using standard buffer 

and temperature conditions using T4 DNA ligase and ATP; 
sticky end ligations require less ATP and less ligase 
than blunt end ligations. When vector fragments are used 
as part of a ligation mixture, the vector fragment is 

25 often treated with bacterial alkaline phosphatase (BAP) 
or calf intestinal alkaline phosphatase to remove the 
5 '-phosphate and thus prevent religation of the vector; 
alternatively, restriction enzyme digestion of unwanted 
fragments can be used to prevent ligation. Ligation 

30 mixtures are transformed into suitable cloning hosts, 
such as £. coli , and successful transformants selected 
by, for example, antibiotic resistance, and screened for 
the correct construction. 

Synthetic oligonucleotides may be prepared 

35 using an automated oligonucleotide synthesizer as 
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described by Warner (1984), DNA 2:401. If desired, the 
synthetic strands may be labeled with 32 P by treatment 
with polynucleotide kinase in the presence of 32 P-ATP, 
using standard conditions for the reaction. DNA 
5 sequences, including those isolated from cDNA libraries, 
may be modified by known techniques, including, for 
example site directed mutagenesis, as described by Zoller 
(1982), Nucleic Acids Res. ifi:6487. 

DNA libraries may be probed using the procedure 

10 of Grunstein and Hogness (1975), Proc. Natl, Acad. Sci. 
USA 72:3961. Briefly, in this procedure, the DNA to be 
probed is immobilized on nitrocellulose filters, 
denatured, and prehybridized with a buffer. The 
percentage of formamide in the buffer, as well as the 

15 time and temperature conditions of the prehybridization 
and subsequent hybridization steps depends on the 
stringency required. Oligomer ic probes which require 
lower stringency conditions are generally used with low 
percentages of formamide, lower temperatures, and longer 

20 hybridization times. Probes containing more than 30 or 

40 nucleotides such as those derived from cDNA or genomic 
sequences generally employ higher temperatures, e.g., 
about 40-42°C, and a high percentage, e.g., 50%, 
formamide. Following prehybridization, 5'- 32 P-labeled 

25 oligonucleotide probe is added to the buffer, and the 

filters are incubated in this mixture under hybridization 
conditions. After washing, the treated filters are 
subjected to autoradiography to show the location of the 
hybridized probe; DNA in corresponding locations on the 

30 original agar plates is used as the source of the desired 
DNA. 

An enzyme-linked immunosorbent assay (ELISA) 
can be used to measure either antigen or antibody 
concentrations. This method depends upon conjugation of 
35 an enzyme to either an antigen or an antibody, and uses 
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the bound enzyme activity as a quantitative label. To 
measure antibody, the known antigen is fixed to a solid 
phase (e.g., a microplate or plastic cup), incubated with 
test serum dilutions, washed, incubated with 
5 anti-immunoglobulin labeled with an enzyme, and washed 
again. Enzymes suitable for labeling are known in the 
art, and include, for example, horseradish peroxidase. 
Enzyme activity bound to the solid phase is measured by 
adding the specific substrate, and determining product 

10 formation or substrate utilization colorimetrically. The 
enzyme activity bound is a direct function of the amount 
of antibody bound. 

To measure antigen, a known specific antibody 
is fixed to the solid phase, the test material containing 

15 antigen is added, after an incubation the solid phase is 
washed, and a second enzyme-labeled antibody is added. 
After washing, substrate is added, and enzyme activity is 
estimated colorimetrically , and related to antigen 
concentration . 

20 

Examples 

I 

This example describes the cloning of the HCV/Jl and 
HCV/J7 nucleotide sequences. 

25 Both blood samples which were used as a source 

of HCV virions were found to be positive in an anti-HCV 
antibody assay. THe HCV isolates from these samples were 
named HCV/Jl and HCV/J7. The infectivity of the blood 
sample containing the Jl isolate was confirmed by a 

30 prospective study of blood transfusion recipients. Dr. 
Tohru Katayama from the Department of Surgery at the 
National Tokyo Chest Hospital collected blood from 
patients who have contracted post-transfusion non-A, non- 
B hepatitis. He also collected blood samples from the 

35 respective blood donors of these patients. Next, these 
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samples were assayed for antibodies to the C100-3 HCVi 
antigen (EPO Pub. No. 318,216), and blood from one of the 
donors was found to be positive'. 

Isolation of the rna from the blood samples 
5 began by pelleting virions in the blood sample by 
ultracentrifugation [Bradley, d.w., McCaustland, ka 
cook E.h., schable, C.A., Ebert, j.w. and Maynard, 
(1985) Gastroenterology ttf 773-779]. rna was then 
extracted from the pellet by the guanidinium/cesium 

10 chloride method [Maniatis T. , Pritsch, E.P., and 
Sambrook J. (1982) "Molecular Cloning: A Laboratory 
Manual-, Cold Spring Harbor Laboratory, Cold Spring 
Harbor] and further purified by phenol/ chloroform 
extraction in the presence of urea, t Berk, A.J. L ee,F., 

15 Harrxson, T. , Williams, j. and Sharp, p.a. (1979) Cell 
12, 935-944]. 

Five pairs of synthetic oligonucleotide primers 
were designed from the C/E, E, E/NS1, NS3, and NS5 
domains of the nucleotide sequence of HCVI to isolate 
fragments from the Ji and J7 genome. The first set of 
primers were to isolate the sequence from the core and 
some of the envelope domain. The second set of primers 
were to isolate the sequences in the envelope domain 
The third set of primers were to isolate a fragment which 
overlapped the putative envelope and non-structural one 
NS1, domains. The fourth and fifth set of primers were' 
used to isolate fragments from non-structural domains 
three and five, NS3 and NS5. The sequences for the 
various primers are shown below: 

The sequence of the primers for the c/E region were: 
21S 5' CGTGCCCCCGCAAGACTGCT 3' 

J80A 5' CCGTCCTCCAGAACCCGGAC 3' 



20 



25 



30 



35 



The sequence of the primers for the E 



region were: 



WO 91/04262 



55- 



Per/US90/05242 



71S 5' GCCGACCTCATGGGGTACAT 3' 

J132A 5' AACTGCGACACCACTAAGGC 3' 

The sequence of the primers for the E/NS1 region were: 
5 127S 5' TGGCATGGGATATGATGATG 3' 

166A 5' TTGAACTTGTGGTGATAGAA 3 ' 

The sequence of the primers for the NS3 region were : 
464S 5' GGCTATACCGGCGACTTCGA 3 ' 

10 526A 5' GACATGCATGTCATGATGTA 3' 

The sequence of the primers for the NS5 region were: 
870S 5' GCTGGAAAGAGGGTCTACTA 3' 

9 17 A 5' GTTCTTACTGCCCAGTTGAA 3' 

15 

1 of the antisense primers , 166A, 526A, or 
917A, was added to 10 units of reverse transcriptase 
(Biorad) to synthesize cDNA fragments from the isolated 
RNA as the template. The cDNA fragments were then 

20 amplified by a standard polymerase chain reaction [Saiki, 
R.K., Scharf, S., Faloona, F. , Mullis, K.B., Horn G.T., 
Erlich, H.A., and Arnheim, N. (1985) Science £30, 1350- 
1354] after l /*g of the appropriate sense primer, 21S, 
71S, 127S, 464S or 870S, was added. 

25 The cDNA fragments amplified by the PCR method 

were gel isolated and cloned by blunt-end ligation into 
the Smal site of pUC119 [Vieira, J. and Messing, J. 
(1987) Methods in Enzymology 151, 3-11] or into the SnaBI 
site of charomid SB, a derivative of the cloning vector 

30 charomid 9-42 [Saito, I. and Stark, G. (1986) Proc. Natl. 
Acad. Sci. USA fi3: 8664-8668]. Clones which contain the 
fragments of the five viral domains were successfully 
constructed. 
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a and J7 IT reaCti ° n ° £ Ji *««>° Elates, 

Jl and J7, thr.e independent clones fro* each real™ 

l e ' e s msi ' M3 - Md Ks5 - — — ~r^ e 

dideoxy chain termination method. 

sequence from an regions except c/E has been 
« ated from the W isolate, sequence from on* the"/E 
regxon has been isolated fro- the J7 isolate 

neSne'rT"' ^ -e 

neither longer or shorter than what would be predicted 
from th e HCV1 genome . However, there is het«ogl^ 
between ciones containing sequence from the same region 

15 ~™" Y : 8 CO — • « constructed L 

each of the domains, c/E, E, E/KS!, HS3 and HS5 as sho» 
restively in Figures x tbrough g> d ^erLl 

Z p b l eXPl :: nad *!* occur random"^!, 
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Plasma of a sxngle healthy carrier and that these genomes 
are heterogeneous at the nucleotide level 

25 „» . T ° ClarUy mS P" 1 "*' U was determined how 

"id n T° tlae — «— 1«0 to L7no 

«ld changes using the sequence from the NS3 domain of 
the » isolate as an example, out of the five nucleotide 
differences, three fall on the third position of the 
amxno acid oodon and do not change the amino ac!d 

fall on the fxrst position of the amino aeid codon and 

p"u"e t^r Chan ' e ' °< * - 

acid residu ' " ^ m ~ -i~ 

35 differ- •-"«*r. When ,naly,i„, the nucleotide 

deferences in other domains, many silent and conserved 
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mutations are found. These results suggest that 
nucleotide sequences of the HCV genomes in the plasma of 
a single healthy donor are heterogeneous at the 
nucleotide level. 

5 In addition, once the consensus sequences for 

each of the fragments were compiled each sequence was 
compared to the HCV1 isolate in Figures 6 through io. 
Figure 6 the fragment from the c/E region of the J7 
isolate shows a 92.8%, 512/552, nucleotide and 97.4%, 
150/154, amino acid homology to the HCVl isolate. The 
fragment from the E domain of Ji shows a slightly lower 
nucleotide and amino acid homology to HCVl in Figure 7 of 
76.2% and 82.9%, respectively. The fragment from the Jl 
isolate which overlaps the envelope and non-structural 
one domains shows the lowest homology to HCVl, as seen in 
Figure 8, where the Jl isolate has a 71.5% nucleotide 
homology and a 73.5% amino acid homology to HCVl. Figure 
9 shows a comparison of the fragment from the NS3 domain 
of Jl to HCVl. The homology between the nucleotides 
20 sequences is 79 . 8 %, while the amino acid homology between 
the isolates is quite high, 92.2% or 179/194 amino acids. 
Figure io shows the homology between the NS5 sequences 
from Ji and HCVl. The sequences have a 84.3% nucleotide 
and 88.7% amino acid homology. 
25 The vectors described in the examples above 

were deposited with the Patent Microorganism Depository 
Fermentation Institute, Agency of Industrial Science and 
Technology at 1-3, Higashi 1-chome Tsukuba-chi, Ibaragi- 
ken 305, Japan, and will be maintained under the 
30 provisions of the Budapest Treaty. The accession numbers 
and dates of the deposit are listed below, on page 68. 

Ill 

An HCV/Ji clone, Jl-1519, was isolated using 
the essentially the techniques described above. However 
the primers used in the isolation were J159S and i 99A 
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The sequences of the oligomeric primers J159S and 199A, 
which follow, were based upon those in Jl-1216 and in 
HCV1. 

J159S 5' ACT GCC CTG AAC TGC AAT GA 3' 

5 199A 5' AAT CCA GTT GAG TTC ATC CA 3' 

Clone Jl-1519 is comprised of an HCV cDNA 
sequence of 367 nucleotides which spans most of the 5'- 
half of the NSl region and which overlaps the E-region 
clone, Jl-1216, by 31 nucleotides. Three independent 

10 clones spanning this region were sequenced; the sequences 
in this region obtained from the three clones were 
identical. The sequence of the HCV cDNA in Jl-1216 
(shown in the figure as Jl) and the amino acids encoded 
therein (shown above the nucleotide sequence) are shown 

15 in Figure 13. Figure 13 also shows the sequence 

differences between Jl-1216 in the comparable region of 
the prototype HCV1 cDNA (indicated in the figure as PT) , 
and the resulting changes in the encoded amino acids. 
The homology between the Jl-1216 and HCV1 cDNA is 

20 approximately 70% at the nucleotide level, and about 75% 
at the amino acid level. 

A composite of the sequences from the putative 
core to NSl region of the Jl isolate is shown in Figure 
14; also shown in the figure are the amino acids encoded 

25 in the Jl sequence. The variation from the HCV1 

prototype sequence is shown in the line below the Jl 
nucleotide sequence; the dashed lines indicate homologous 
sequences. The nonhomologous amino acid encoded in the 
HCV1 prototype sequence is shown below the HCV1 

30 nucleotide sequence. 

Cloned material containing the Jl/1519 HCV cDNA 
(pSl-1519) has been maintained in DH5oc, and deposited 
with the Patent Microorganism Depository. 
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IV 

Several regions of the Jl isolate, 
including the C200-C100 region from the putative NS3-NS4 
region (which encompasses the region encoding the 5-1-1 
5 polypeptide in HSV1 (See EPO Pub. No. 3 18 , 2 16), and the 
putative NS1 - E region, were amplified using the PCR 
method. The C200-C100 region includes nucleotides 3799 
to 5321 of the prototype HCVl. RNA was extracted as 
described above, except that extraction was with 

10 guanidinium thiocyanate in the presence of Proteinase K 
and sodium dodecylsulf ate (SDS) (Maniatis (1982) , supra) . 
The RNA was transcribed into HCV cDNA by incubation in a 
25 Ml reaction comprised of 1 jiM of each primer, 40 units 
of RNase inhibitor (RNASIN) , 5 units of AMV reverse 

15 transcriptase, and salts and buffer necessary for the 
reaction. Amplification of a segment of the HCV cDNA 
from the designated region was performed utilizing pairs 
of synthetic oligomer 16-mer primers. PCR amplification 
was accomplished in three rounds (PCR I, PCR II, and 

20 PCRIII) . The second and third rounds of PCR 

amplification (PCR II) utilized different sets of PCR 
primers; the first PCR reaction was diluted 10-fold and 
multiple rounds of PCR amplification were carried out 
with the new primers, so that ultimately up to 50% of the 

25 products of the first PCR reaction (PCR I) were 

reamplified. The primers used for the amplification of 
the regions were the following. These primers, with the 
exception of J1C200-3 which was derived from the Jl 
isolate sequence, were derived from the prototype HCVl 

30 sequence. 

;.\^. ; V Primers for amplifi cation of the "5-1-1" region 

from NS3-NS4. 

PCR I 

511/16A (sense, derived from nucleotides starting at 
35 number 1528 of HCVl) 



WO 91/04262 PCT/US90/05242 

-60- 

5' AAC AGG CTG CGT GGT C 3' 
53t1/16B (anti-sense, derived fVr™ 

5260 of HCV1) nucleotides ending at 

5' AGT TGG TCT GGA CAG C 3' 

5 PCR II 

starting at number 5057 of HSVl; the restriction en*yme 

Site is underlined) «*yiae 

(anti-sense, the HCV portion derived from 
nucleotides ending at number 5233 of HSVl; the 
restriction enzyme site is underlined) 

5' CCT CTG OCT GAG GGG ACG CGG TCT GC 3' 

PCRIII bl - 3 

(see supra) 

(antisense, derived from nucleotides ending at 
number 5804 of hsvi) 9 

5' GTA GTG CGT GGG GGA AAC AT 3' 
20 PCR PrW>erS f ° r flrcplifical ^ rm 0 f j- ho n ^ sl ^ F „ ^.^^ 

Ji^m (sense th. HCV portion derive, from nucieotide, 
starting at number 9S3 of HSVl, the restriction enzyme 
site is underlined) y 



25 



30 



35 



5' STTAGAATTP TGG CAT GGG ATA TGA TGA TG 3' 
^Ilil (sense, the HCV portion derived from nucleotides 
starting at number 1087 of HSVl, the restriction enzyme 
site is underlined) enzyme 

5' CTTAGAATTr TCC ATG GTG GGG AAC TGG GC 3' 
(anti-sense, the HCV portion derived from 

nucleotxdes ending at 199 5 of HSVl, the restriction 

enzyme site is underlined) 

5' S2SAJ£2C TAA CGG GCT GAG CTC GGA 3' 

^11 (anti-sense, the HCV portion derived from 

nucleotides ending at 1941 of hsvi, the restriction 

enzyme site is underlined) 
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-el- 
s' CTTAGAATTC CGT CCA GTT GCA GGC AGC TTC 3' 

PCR II 

Jlrcl3 (see supra) 

5 JlIZ-1 (sense, the HCV portion is derived from 
nucleotides starting at number 1641 of HCV1, the 
restriction enzyme site is underlined) 

5' CTTGAATTC CAA CTG GTT CGG CTG TAG A 3' 
jiiz-2 (sense, the HCV portion is derived from 
10 nucleotides starting at number 1596 of HCV1, the 
restriction enzyme site is underlined) 

5' TGA GAC GGA CGT GCT GCT CCT 3' 
Primers for the C20 0-C100 region of the "NS3-NS4" region 
PCR I 

15 J1C200-1 (sense, derived from nucleotides starting at 

number 3478 of HCV1) 

5' TCC TAC TTG AAA GGC TC 3' 

J1C200-3 (anti-sense, derived from nucleotides ending at 

number 4402 of HCV1) 
20 5' GGA TCC AAG CTG AAA TCG AC 3' 

Jirc52 (anti-sense, the HCV portion derived from 

nucleotides ending at 5853 of HCV1, the restriction 

enzyme site is underlined) 

5' CTTAGAATTC GAG GCT GCT GAG ATA GGC AGT 3' 
25 511/16A (see above) . 

PCR II 

J1C200-2 (sense, the HCV portion derived from nucleotides 
starting at number 3557 of HCVi, the restriction enzyme 
site is underlined) 

30 5' CTTGAATTC CCC GTG GAG TGG CTA AGG CGG TGG ACT 3' 

J1C200-4 (anti-sense, the HCV portion derived from 
nucleotides ending at 4346 of HCVI, the restriction 
enzyme site is underlined) 

5' CTTGAATTC TCG AAG TCG CCG GTA TAG CCG GTC ATG 3' 

35 511/ 3 5A (see above) 
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S2XBS1 ^(anti-sense, the HCV portion derived from 

nucleotides ending at 5826 of HCV1 , the 

enzyme site is underlined) miction 

5' SmSMETS GGC AGC TGC ATC GCT CTC CGG CAC 3< 
5 The amplified HCV cDNAs were eitho^ - 

with™,*- „t • either sequenced directly 

without cloning, and/or were cloned. Sequencing was 
accomplished using an assymetric pcr technique 
— «t»lly as described i„ shyamala and Ame^, j 

on : i e : N L is Pr carried ° ut with a 

« one of the primers (usually in a rai-i« 

15 m ho « iae °xy chain termination method. 

the PCR .e^eT^r "'^ S * ,ttW,0iB ' * 

jii* , ., , following. For the HSl region- 

!f ! ° n b ° th Strands » • *> r the NS3-HS4 regiof 

» ^fctrv" caoo - cio ° - ta ™ iMi ^ « - 

C100 c-terainal region, and the 5-1-1 region- , 
end ,1«00-, (£o r the K— terminal region o^oo-™ " 
and J1C200-4 and nc2 oo- e (£or the C200-C100 .c-t^iial 
. ^on, ; ana 5H/3SA .„ d hep 4 (for ^ s^*™™* 1 

^(derived f ro. nucleotides startlng ^ ^ ^ 
5' IT GGC TAG TOG TTA CTG GGC TGG TGA CAG 3- 

sss :r r portion 

S' fiHSAAffls CGT ACT CCA CCT ACG GCA AGT TCC TT 3' 



30 



35 



WO 91/04262 PCT/US90/05242 

-63- 

.71C300-7 (the HCV portion derived from nucleotides 
starting at number 3946 of HCVl, the restriction enzyme 
site is underlined) 

5' CTTGAATTC GTG GCA TCC GTG GAG TGG CAC TCG TC 3' 
5 The sequences obtained by assymetric sequencing 

of the "NS1" region, the C200-C100 region, and the 5-1-1 
region are shown in Figure 15, and Figure 16, 
respectively. In the figures, the amino acids encoded in 
the Jl sequence are shown above the Jl nucleotide 
10 sequence. The differences between the Jl sequence and 

the HCV1 prototype nucleotide sequence is shown below the 
Jl sequence (the dashes indicate homologous nucleotides 
in both sequences) . The encoded amino acids which differ 
in the HCVl prototype sequence are shown below the HCVl 
15 nucleotide sequence. 

HCV cDNAs from the NS1 region, the C200-C100 
region, and the 5-1-1 region were cloned. A 300 bp and a 
230 bp fragment from the putative NSl region, were cloned 
into a derivative of the commercially available vector, 
20 pGEM-32, in host HB101, and deposited with the ATCC as 
AW-300bp. The derivative vectors maintain the original 
PGEM-3Z polylinkers, an intact Amp r gene, and the genes 
required for replication in coli . The HCV cDNA 
fragments may be removed with SacI and Xbal. HCV cDNAs 
25 containing 770 bp N-terminal fragments of C200 were 
cloned into pMIE in HB101, 12 clones were pooled and 
deposited with the ATCC as AW-770bp-N; the HCV cDNA may 
be removed from the vector with Haell. The resultant 
Haell fragment will contain vector DNA of 300 bp and 250 
30 bp at the 5' and 3' ends, respectively. HCV cDNAs 
containing 700 bp C-terminal fragments of C200 (AW- 
700bp-C) were cloned into M13mpl0 and maintained in host 
DH5a-F'; cloning was into the vector poly linker site. 
The resultant phage were pooled, and deposited with the 
35 ATCC on September 11, 1990 as AW-700bp-N or AW-700bp-C. 
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HCV cDNA from Ji equivalent to the r i i 

to tne 5-1-1 region of hcvi 
was cloned into mpl9 ri site „ . . . HCV1 

Sev^i u ' maintained in DH5ot-P' . 

Several .13 phage superanants from this cloning were 
pooled and deposited with the ATCC as Ji 5-,-, " on 

5 TT" 1990 ' ^ HCV ° DNAS «^ be rained from 

Ts TT I ^ ECORI - ACCeSsion numberTfor 

5 " 1 - 1 and AW-700bp-N or AW-700bp-c may be obtain^ ! 
telephoning the ATCC at (301) 88I-L00 * 
The above-described cloned material was 

ZlT d with the *" rloaB *»• coiiection 
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An HCV cDNA library containing sequences of 
the putative region of thfi n ^ * 

by directional cloning in X-gt22. The W reg L 
extends from about nucleotide 1460 to about nucleotide 
2730 using the numbering system of the HCVI prototype 
nuc eic acid sequence, where nucleotide 1 is the f St 
nucleotide of the initiating methionine codon f or S 
putative polyprotein. The cloning was accomplished usin* 
essentially the method described by Han and Lter in ? 

^shinTr^"' 10 (J * K - S6tlOW ' P1 -um 
Publxshing Co., 1988), except that the primers for the 

synthesis of the first and second strand of HCV cDNA were 

JHC67 and JHC68, respectively, and the source of rna Z 

the JI plasma. ln this method the RNA is extracted 

guanidium thiocyanate at a low temperature. The RNA is^ 

then converted to full length cDNA, which is 

defined orientation relative to the ^ promoter in X- 

Phage. Using this method, the HCV cDNAs to Ji rn A we " e 

int ° ^ NOtI ° f X " 9t22 ' The Presence^ 

:n:izr in the ubra - ~ — — - 
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The sequence of a region of "NSl" downstream 
from the region shown in Figure 14, but which overlaps 
the region by about 20 nucleotides, was determined using 
the assymetric sequencing technique described above, but 
5 substituting as primers for PCR amplification, Alx 61 and 
Alx 62. The resulting sequence is shown in Figure 17. 
(It should be noted that the PCR amplification was of a 
region from about nucleotide 1930 to about nucleotide 
2340; this region is also encompassed in the sequence 
10 shown in Figure 15) • The sequences of the primers and 

probes used to obtain the HCV cDNA library in X-gt22, and 
to sequence the portion of the "NSl" region were the 
following* 
JflC 67 

15 5' GACGC GGCCG CCTCC GTGTC CAGCG CGT 3' 

JHC 68 

5' CGTGC GGCCG CAAGA CTGCT AGCCG AGGT 3' 

ALX 61 

5' ACCTG CCACT GTGTA GTGGT CAGCA GTAAC 3' 

20 ALX 62 

5' ACGGA CGTCT TCGTC CTTAACAATA CCAGG 3' 

ALX 54 

5' GAACT TTGCG ATCTG GAAGACAGGG ACAGG 3' 
A 400 bp fragment of Jl HCV cDNA derived from 
25 the sequenced region was cloned into pG£M3z and 

maintained in HB101; the HCV cDNA may be removed from the 
vector with SacI and Xbal. Host cells transformed with 
the vector (JH-400bp) have been deposited with the ATCC. 

A pooled cDNA library was created from the Jl 
30 serum; the pooled library spans the Jl genome and is 

identified as HCV-Jl X gt22. The pooled cDNA library was 
created by pooling aliquots of 11 individual cDNA 
libraries, which had been prepared using the directional 
cloning technique described above, except that the 
35 libraries were created from primers which were designed 
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to yield HCV cDNAs which spanned the genome. The printers 
were derived from the sequence of hcvi, and included JHC 
67 and JHC 68. The HCV cDNAs were inserted into the Notl 
site of X-gt22. The pooled cDNA library, HCV-Jl X gt22, 
5 has been deposited with the ATCC. 

YI 

The sequence of a region of the polynucleotide 
upstream of that shown in Figure 14 was determined. This 
region begins at nucleotide -267 with respect to the HCV- 
1 (See Figure 12) and extends for 560 nucleotides. 
Sequencing was accomplished by preparing HCV cDNA from 
RNA extracted from Jl serum, and amplifying the HCV cDNA 
using the PCR method. 

RNA was extracted from 100 m of serum 
following treatment with proteinase K and sodium 
dodecylsulf ate (SDS) . The samples were extracted with 
phenol-chloroform, and the rma precipitated with ethanol. 

HCV cDNA from the Jl isolate was prepared by 
denaturing the precipitated RNA with 0.01M MeHgOH; after 
ten minutes at room temperature, 2-mercaptoethanol was 
added to sequester the mercury ions. Immediately, the 
mix for the first strand of cDNA synthesis was added, and 
incubation was continued for 1 hr at 37°c. The 
conditions for the synthesis of the anti-sense strand 
were the following: 50 mM Tris HC1, p H 8.3, 75 mH KC1 3 
mM Mgci 2 , io mH dithiothreitol, 500 M M each 
deoxynucleotide triphosphate, 250 paol specific antisense 
cDNA primer r25, 250 units MMLV reverse transcriptase 
in order to synthesize the second strand (sense), the 
synthesis reaction components were added, and incubated 
for one hour at I4°c. The components for the second 
strand reaction were as follows: 14 mM Tris HC1, pH 8.3 
68 mM KC1, 7.5 mM ammonium sulfate, 3.5 mM MgCl., 2.8 m 
dithiothreitol, 25 units DNA polymerase I, and one unit 
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RNase H. The reactions were terminated by heating the 
samples to 95°C for 10 minutes , followed by cooling on 
ice. 

The HCV cDNA was amplified by two rounds of 
5 PCR. The first round was accomplished using 20 pi of the 
cDNA mix. The conditions for the PCR reaction were as 
follows: 10 mM Tris HC1, pH 8.3, 50 mM KC1, 1.5 mM MgCl 2 , 
0.002% gelating, 200 mM each of the deoxynucleotide 
triphosphates, and 2.5 units Amplitaq. The PCR thermal 
1° cycle was as follows: 94°C one minute, 50°C one minute, 
72°C one minute, repeated 40 times followed by seven 
minutes at 72°C. The second round of PCR was 
accomplished using nested primers (i.e. primers which 
bound to an internal region of the first round of PCR 
amplified product) to increase the specificity of the PCR 
products. One percent of the first PCR reaction was 
amplified essentially as the first round, except that the 
primers were substituted, and the second step in the PCR 
reaction was at 60°C instead of 50°C. The primers used 
for the first round of PCR were ALX90 and rl4. The 
primers used for the second round of PCR were rl4 and 
pl4. 

The sequences of the primers for the synthesis 
of HCV cDNA and for the PCR method were the following. 
r25 

5 ' ACC TTA CCC AAA TTG CGC GAC CTA 3 ' 
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30 



ALX90 



rl4 



pl4 



5 , CCA TGA ATC act CCC CTG TGA GGA ACT A 3 J 
5' GGG CCC CCAG CTA GGC CGA GA 3' 



35 



5' AAC TAC TGT CTT CAC GCA GAA AGC 3' 
The PCR products were gel purified, the 
material which migrated as having about 615 bp was 
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isolated and sequenced by a modification of the Sanger 
dxdeoxy cham termination method, using "p- atp as J£ 
In the modified »ethcd. the sequence replication was 

5 T P " "* M1 " PrinBrS ' *■«•*• stranded 

5 DHA was melted for 3 minutes at 95°c prior to 

replication, and the synthesis of labeled dideoxy 

terminated polynucleotides was catalysed by Bst 

polymerase (obtained from BioRad Corp.,, according to the 

usingjoon, to l „g o£ M produot _ 

The primers P32 (sense) and R31 (antieense) 
were derived from nucleotides -13, to . U5 ' M ~' 
nucleotxdes „a to 1,3, respectively, of the HCVl 
stance. The sequences of the primers are the 

following. 
P32 primer 

5' AAC CCG CTC AAT GCC TGG AGA TT 3' 



20 



R31 primer 

5' GGC CGX CGA GCC TTG GGG AT 3 
where X = a or G 



/ 



„hi= h s «J»ence of the region in the Jl isolate 

which encompasses the 5-untransl.t.d region as well as a 

LT " e fi9Ur '' a " in ° aci4s 1" the Jl 

sequence are shown above the nucleotide sequence. 

sequence of the prototype HCVl is shown ^ 
sequence; the dashes indicate sequence homology with Jl. 

are shown below the HCVl sequence. 

of the .o.*L"£ ODKA fra9Bent " hieh iS * tentative 
of the 6 oo bp Jl sequence described above (Tc 600bp) was 

oloned mto pGEHSZ and maintained in host HB101; tie Z 
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cDNA fragment may be removed with SacI and Xbal. This 
material is on deposit with the ATCC. 



Patent Microorganism Depository-deposited under Budapest 
5 Treaty terms. 

Deposited Materials Accession Number Deposit Date 
E. coli DH5/pSl-8791a BP-2593 9/15/1989 

(This clone contains 427 bp of the HS5 domain of Jl) 
E. coli HB101/pUl-1216c BP-2594 9/15/1989 

10 (This clone contains 351 bp of the E/NS1 domains of Jl) 
E. coli HB101/pUl-4652d BP-2595 9/15/1989 

(This clone contains 583 bp of the NS3 domain of Jl) 
E.coli DH5a/pSl-713c BP-2637 11/1/1989 

(This clone contains 580 bp of the E domain of Jl) 

15 E. coli DH5a/pS7-28c BP-2638 11/1/1989 

(This clone contains 552 bp of the C/E domain of J7) 
E. coli DH5a/psl-1519 BP3081 8/30/90 



The following vectors described in the Examples 
20 were deposited with the American Type Culture Collection 
(ATCC), 12301 Parklawn Dr., Rockville, Maryland 20852, 
and have been assigned the following Accession Numbers. 
The deposits were made under the terms of the Budapest 
Treaty. 

25 Deposited Materials Accession Number Deposit Date 

TC-600BP (in 

E. coli HB101/pGEM32) 68393 9/11/90 

JH-400bp (in 

E. coli HB101/pGEM3Z) 68394 9/11/90 

30 AW-300bp (in 

E. coli HB101/pGEM32) 68392 9/11/90 

AW-770bp-N (in 

E. coli HBlOl/pMIE) 68395 9/11/90 

AW-700bp-C or AW-700bp-N (in 
35 E. coli DH5a-F'/M13mpl0) 
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Jl 5-1-1 (in 

E. coli DH5a-F'/M13mpl0) 
HCV-J1 X gt22 40884 9/6/90 

These deposits are provided for the convenience of those 

5 skilled in the art. These deposits are neither an 

admission that such deposits are required to practice the 
present invention nor that equivalent embodiments are not 
within the skill of the art in view of the present 
disclosure. The public availability of these deposits is 
10 not a grant of a license to make, use or sell the 

deposited materials under this or any other patent. The 
nucleic acid sequences of the deposited materials are 
incorporated in to present disclosure by reference, and 
are controlling if in conflict with any sequences 

15 described herein. 

While the present invention has been described 
by way specific examples for the benefit of those in the 
field, the scope of the invention is not limited as 
additional embodiments will be apparent to those of skill 

20 in the art from the present disclosure. 
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CLAIMS 



5 1. A DNA molecule comprising a nucleotide 

sequence of at least 15 bp from an HCV isolate 
substantially homologous to an isolate selected from the 
group Jl and J7, wherein said nucleotide sequence is 
distinct from the nucleotide sequence of HCV isolate 
10 HCV1. 



2. A DNA molecule comprising a nucleotide 
sequence of at least 15 bp encoding an amino acid 
sequence from the HCV isolate Jl or J7 wherein the Jl or 

15 J7 amino acid sequence is distinct from the amino acid 
sequence of HCV isolate HCV1. 

3. A DNA molecule according to claim 2 wherein 
the Jl or J7 amino acid sequence comprises a 

20 substantially complete viral polypeptide. 

4. A DNA molecule according to claim 2 wherein 
the J7 amino acid sequence is amino acid 1 to amino acid 
115. 

25 

5. A DNA molecule according to claim 1 wherein 
the Jl amino acid sequence is from amino acid 116 to 
amino acid 350. 

30 6. A DNA molecule according to claim 2 wherein 

the Jl amino acid sequence is from amino acid 351 to 
amino acid 651. 
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7 . A DNA molecule according to claim 2 wherein 
the Jl amino acid sequence is from amino acid 1007 to 
amino acid 1650. 



5 8. A DNA molecule according to claim 2 wherein 

the Jl amino acid sequence is from amino acid 2100 to the 
end of the coding sequence* 

9. A purified polypeptide comprising an amino 
10 acid sequence from an HCV isolate substantially 

homologous to an isolate selected from the group 
consisting of Jl and J7 wherein the amino acid sequence 
is distinct from the sequence of the polypeptides encoded 
by the HCV isolate HCV1. 

15 

10. A purified polypeptide according to claim 
9 wherein the Jl or J7 amino acid sequence comprises an 
epitope that is not immunologically cross-reactive with 
any HCVl epitope. 

20 

11. A purified polypeptide according to claim 
9 wherein the J7 amino acid sequence is from amino acid 1 
to amino acid 115. 

25 12. A purified polypeptide according to claim 

9 wherein the Jl amino acid sequence is from amino acid 
116 to amino acid 350. 

13. A purified polypeptide according to claim 
30 9 wherein the Jl amino acid sequence is from amino acid 
351 to amino acid 651. 



35 



14. A purified polypeptide according to 
9 wherein the Jl amino acid sequence is from amino 
1007 to amino acid 1650. 



claim 
acid 
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15. A purified polypeptide according to claim 
9 wherein the Jl amino acid sequence is from amino acid 
2100 to the end of the coding sequence. 

5 

16. A polypeptide comprising an amino acid 
sequence from an HCV isolate selected from the group 
consisting of Jl and J7 wherein the Jl or J7 amino acid 
sequence is distinct from the amino acid sequence of HCV 

10 isolate HCV1 and the polypeptide is immobilized on a 
solid support. 

17. An immunoassay for detecting the presence 
of anti-HCV antibodies in a test sample comprising: 

(a) incubating the test sample under 
conditions that allow the formation of antigen- 
antibody complexes with an antigenic 
polypeptide comprising an amino acid sequence 
from an HCV isolate selected from the group 
consisting of Jl and J7, wherein the amino acid 
sequence is distinct from the amino acid 
sequence of HCV isolate HCV1; and (b) the Jl or 
J7 amino acid sequence' is not immunologically 
cross-reactive with HCVl; and 

(b) detecting an antigen-antibody complex 
comprising the antigenic polypeptide. 

18. An immunoassay according to claim 17 
wherein the Jl amino acid sequence is from a viral domain 

30 selected from the group consisting of amino acid sequence 
from amino acid 116 to amino acid 350, from amino acid 
351 to amino acid 651, from amino acid 1007 to amino acid 
1650, and from amino acid 2100 to the coding sequence. 
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19. An immunoassay according to claim 17 
wherein the Jl amino acid sequence is from amino acid 1 
to amino acid 115. 

5 20. An immunoassay according to claim 17 

wherein the test sample comprises human blood or a 
fraction thereof. 



21. A composition comprising anti-HCV 

10 antibodies that bind an HCV epitope substantially free of 
antibodies that do not bind an HCV epitope, wherein: 

(a) the HCV epitope comprises an amino acid 

sequence from an HCV isolate selected from the 

group consisting of Jl and J7; 
15 (b) the Jl of J7 amino acid sequence is 

distinct from the amino acid sequence of HCV 

isolate HCVl; and 

(c) the Jl or J7 amino acid sequence is not 
immunologically cross-reactive with HCVl. 

20 

22. A composition according to claim 21 
wherein the anti-HCV antibodies are polyclonal. 

23. A composition according to claim 21 
25 wherein the anti-HCV antibodies are monoclonal. 



24. An immunoassay for detecting the presence 
of an HCV polypeptide in a test sample comprising: 
(a) incubating the test sample under 
conditions that allow the formation of antigen- 
antibody complexes with anti-HCV antibodies 
that bind an HCV epitope wherein: 

(i) the HCV epitope comprises an amino 
acid sequence from an HCV isolate selected 
from the group consisting of Jl and J7; 
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(ii) the Jl or J7 amino acid sequence is 
distinct from the amino acid sequence of 
HCV isolate HCVi; and 

(iii) the Jl or Jl amino acid sequence is 
5 not immunologically cross-reactive with 

HCVI; and 

detecting an antigen-antibody complex 



(b) 



10 



15 



comprising the anti-HCV antibodies. 

25. An immunoassay according to claim 24 
wherein the Jl amino acid sequence is from a viral domain 
selected from the group consisting of amino acid sequence 
from amino acid lis to amino acid 350, from amino acid 
351 to amino acid 651, from amino acid 1007 to amino acid 
1650, and from amino acid 2100 to the end of the coding 
sequence . 



26. An immunoassay according to claim 24 
wherein the J7 amino acid sequence is from amino acid 1 

20 to amino acid 115. 

27. A method of producing anti-HCV antibodies 
comprising administering to a mammal a polypeptide 
comprising an amino acid sequence from an HCV isolate 

25 selected from the group consisting of Jl and J7 wherein 
the Jl or J7 amino acid sequence is distinct from the 
amino acid sequence of HCV isolate HCVi whereby the 
mammal produces anti-HCV antibodies. 

30 28. A method of detecting HCV polynucleotides 

in a test sample comprising: 

(a) providing a probe comprising the DMA 
molecule of claim 1; 

(b) contacting the test sample and the probe 
35 under conditions that allow for the formation 
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of a polynucleotide duplex between the probe 
and its complement in the absence of 
substantial polynucleotide duplex formation 
between the probe and non-HCV polynucleotide 
sequences present in the test sample; and 
(c) detecting any polynucleotide duplexes 
comprising the probe. 

DolvDeBH A meth ° d of Pacing a recombinant 

»e^ c °»P"sing an HCV amino acid sequence, the 

method comprising: 

(a) providing host cells transformed by a DNA 
construct comprising a control sequence for the 

T o ?* el l ° P6rably linked to a c °*ing sequence 
for the host cell operably linked to a coding 
sequence encoding an amino acid sequence from 
an HCV isolate selected from the group 
comprised of ji and J7 wherein the Ji or J7 
amino is distinct from the amino acid sequence 
of HCV isolate HCVl; 

(b) growing the host ceils under conditions 
whereby the coding sequence is transcribed and 

translated into the recombinant polypeptide- 

and ' 



15 



20 



25 



and 
(c) 



30 



recovering the recombinant polypeptide. 

30. a biological material derived from the 
group consisting of materials deposited under Accession 
Numbers BP-2593, BP-2594, BP-2595, BP-2637, BP-2638 BP 
3081 r ATCC No. 68392 ATCC No ««, Z ' 

ATCC N„ «o,« 8393 ' ATCC No - 683 94, 

ATCC No. 68395, and ATCC No. 408884 
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J7 1 AGCCGAGTAGTGTTGGGTCGCGAAAGGCCTTGTGGT 

discrepancy 

clone 

altered aa 

J7 37 ACTGCCTGATAGGGTGCTTGCGAGTGCCCCGGGAGG 

Met Ser Thr Asn 

J7 73 TCTCGTAGACCGTGCATC ATG AGC ACA AAT 

Pro Lys Pro Gin Arg Lys Thr Lys Arg 
J7 103 CCT AAA CCC CAA AGA AAA ACC AAA CGT 

T G 
b 1 
Arg 



Asn Thr Asn Arg Arg Pro Gin Asp Val 
J7 130 AAC ACC AAC CGT CGC CCA CAG GAC GTT 

C 
b 



Lys Phe Pro Gly Gly Gly Gin He Val 

J7 157 AAG TTC CCG GGC GGT GGT CAG ATC GTC 

T T 
1 b 
Leu 



J7 



Gly Gly Val Tyr Leu Leu Pro Arg Arg 
184 GGT GGA GTT TAC TTG TTG CCG CGC AGG 

A 
b 



Gly Pro Arg Leu Gly Val Arg Ala Thr 
J7 211 GGC CCC AGG TTG GGT GTG CGT GCG ACT 

FIG. 1-1 
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Arg Lys Thr Ser Glu Arg Ser Gin Pro 
J7 238 AGG AAG ACT TCC GAG CGG TCG CAA CCT 

A 
b 



Arg Gly Arg Arg Gin Pro lie Pro Lys 
J7 265 CGT GGA AGG CGA CAA CCT ATC CCC AAG 

Ala Arg Arg Pro Glu Gly Arg Thr Trp 

J7 292 GCT CGC CGG CCC GAG GGC AGG ACC TGG 

Ala Gin Pro Gly Tyr Pro Trp Pro Leu 
J7 319 GCT CAG CCT GGG TAT CCT TGG CCC CTC 



Tyr Gly Asn Glu Gly Leu Gly Trp Ala 

J7 346 TAT GGC AAT GAG GGC TTG GGG TGG GCA 

A 
b 
END 



Gly Trp Leu Leu Ser Pro Arg Gly Ser 
J7 373 GGA TGG CTC CTG TCA CCC CGC GGC TCT 



Arg Pro Ser Trp Gly Pro Asn Asp Pro 
J7 400 CGG CCT AGT TGG GGC CCC AAT GAC CCC 

T C 
c b 
Thr 



Arg Arg Arg Ser Arg Asn Leu Gly Lys 
J7 427 CGG CGT AGG TCG CGT AAT TTG GGT AAG 



Val lie Asp Thr Leu Thr Cys Gly Phe 
J7 4 54 GTC ATC GAT ACC CTT ACA TGC GGC TTC 

FIG. 1-2 i 

rs Leu 
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Ala Asp Leu Met Gly Tyr lie Pro Leu 
J7 481 GCC GAC CTC ATG GGG TAC ATT CCG CTT 

C C 
c b 



Val Gly Ala Pro Leu Gly Gly Ala Ala 
J7 508 GTC GGC GCC CCC TTA GGG GGC GCT GCC 

Arg Ala Leu Ala His Gly 
J7 535 AGG GCC CTG GCA CAT GGT 

FIG. 1-3 
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Pro Leu Val Gly Ala Pro Leu Gly Gly 

j! IT CCG CTC GTC GGC GCC CCC TTA GGG GGC 

discrepancy c 

clone d 

altered aa Ser 

Ala Ala Arg Ala Leu Ala His Gly Val 

29 GCT GCC AGG GCC CTG GCA CAT GGT GTC 



Arg Val Leu Glu Asp Gly Val Asn Tyr 
56 CGG GTT CTG GAG GAC GGC GTG AAC TAT 



Ala Thr Gly Asn Leu Pro Gly Cys Ser 
83 GCA ACA GGG AAT TTG CCC GGT TGC TCT 



Phe Ser lie Phe Leu Leu Ala Leu Leu 
110 TTC TCT ATC TTC CTC TTG GCT CTG CTG 

A T 
g d 



Ser Cys Leu Thr lie Pro Ala Ser Ala 
137 TCC TGT TTG ACC ATC CCA GCT TCC GCT 

Tyr Glu Val Arg Asn Val Ser Gly lie 
164 TAT GAA GTG CGC AAC GTG TCC GGG ATA 



Tyr His Val Thr Asn Asp Cys Ser Asn 
191 TAC CAT GTC ACA AAC GAC TGC TCC AAC 

T 
d 



Ser Ser lie Val Tyr Glu Ala Ala Asp 
218 TCA AGC ATT GTG TAT GAG GCG GCG GAC 



FIG. 2-1 
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Val lie Met His Ala Pro Gly Cys Val 

245 GTG ATC ATG CAT GCC CCC GGG TGC GTG 



Pro Cys Val Arg Glu Asn Asn Ser Ser 
272 CCC TGC GTT CGG GAG AAC AAT TCC TCC 

C 
d 



299 



Arg Cys Trp Val Ala Leu Thr Pro Thr 
CGT TGC TGG GTA GCG CTC ACT CCC ACG 



326 



Leu Ala Ala Arg Asn Ala Ser Val Pro 
CTC GCG GCC AGG AAT GCC AGC GTC CCC 



353 



Thr Thr Thr Leu Arg Arg His Val Asp 
ACT ACG ACA TTA CGA CGC CAC GTC GAC 

G 

d 



380 



Leu Leu Val Gly Thr Ala Ala Phe Cys 
TTG CTC GTT GGG ACG GCT GCT TTC TGC 



407 



Ser Ala Met Tyr Val Gly Asp Leu Cys 
TCC GCT ATG TAC GTG GGG GAT CTC TGC 



434 



Gly Ser Val Phe Leu lie Ser Gin Leu 
GGA TCT GTT TTC CTC ATC TCC CAG CTG 

T 

d 



461 



Phe Thr Phe Ser Pro Arg Arg His Glu 
TTC ACC TTC TCG CCT CGC CGG CAT GAG 



FIG. 2-2 



substituteTsheet 
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Thr Val Gin Asp Cys Asn Cys Ser He 

ACA GTA CAG GAC TGC AAC TGC TCA ATC 



Tyr Pro Gly His Val Ser Gly His Arg 
TAT CCC GGC CAC GTA TCA GGC CAT CGC 

T 
c 



Met Ala Trp Asp Met Met Met Asn Trp 
ATG GCT TGG GAT ATG ATG ATG AAC TGG 



Ser Pro Thr Ala 
TCG CCC ACG GCA 

FIG. 2-3 
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Asn Trp Ser Pro Thr 
jl 1 AAC TGG TCG CCC ACG 

discrepancy 
clone 

altered aa 



Ala Ala Leu Val Val Ser Gin Leu Leu 
jl 16 GCA GCC TTA GTG GTG TCG CAG TTA CTC 

i i i 

• • • 



Arg lie Pro Gin Ala Val Met Asp Met 
Jl 43 CGG ATC CCA CAA GCT GTC ATG GAC ATG 



Val Ala Gly Ala His Trp Gly Val Leu 
Jl 70 GTG GCG GGG GCC CAC TGG GGA GTC CTA 

G 
1 



Ala Gly Leu Ala Tyr Tyr Ser Met Val 

jl 97 GCG GGC CTT GCC TAC TAT TCC ATG GTG 

A 
1 



Gly Asn Trp Ala Lys Val Leu lie Val 
Jl 124 GGG AAC TGG GCT AAG GTT TTG ATT GTG 



Met Leu Leu Phe Ala Gly Val Asp Gly 
Jl 151 ATG CTA CTC TTT GCC GGC GTT GAC GGG 



His Thr Arg Val Thr Gly Gly Val Gin 
Jl 178 CAT ACC CGC GTG ACG GGG GGG GTG CAA 
AG A 
gg i 
Ser 

FIG. 3-1 
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Gly His Val Thr Ser Thr Leu Thr Ser 
Jl 205 GGC CAC GTC ACC TCT ACA CTC ACG TCC 

T G 
c i 
Ala 



Leu Phe Arg Pro Gly Ala Ser Gin Lys 
Jl 232 CTC TTT AGA CCT GGG GCG TCC CAG AAA 



lie Gin Leu Val Asn Thr Asn Gly Ser 
Jl 259 ATT CAG CTT GTA AAC ACC AAT GGC AGT 

TC T 

• • • 

11 1 
Ser Leu 



Trp His lie Asn Arg Thr Ala Leu Asn 
Jl 286 TGG CAT ATC AAC AGG ACT GCC CTG AAC 

T 

g 



Cys Asn Asp Ser Leu Gin Thr Gly Phe 
Jl 313 TGC AAT GAC TCC CTC CAA ACT GGG TTC 



Leu Ala Ala Leu 
Jl 340 CTT GCC GCG CTG 

FIG. 3-2 
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9/79 Ser 
Jl 1 C TCA 

discrepancy 
clone 

altered aa 



Val lie Asp Cys Asn Thr Cys Val Thr 
Jl 5 GTG ATC GAC TGT AAC ACA TGT GTC ACT 



Gin Thr Val Asp Phe Ser Leu Asp Pro 
Jl 32 CAG ACG GTC GAT TTC AGC TTG GAT CCC 



Thr Phe Thr lie Glu Thr Thr Thr Val 
Jl 59 ACC TTC ACC ATC GAG ACG ACG ACC GTG 

G 
c 

Ala 



Jl 86 

Jl 113 

Jl 140 

Jl 167 

Jl 194 

Jl 221 



Pro Gin Asp 
CCC CAA GAT 



Arg Arg Gly 
CGG CGA GGT 

Arg Gly lie 
AGA GGC ATC 



Gly Glu Arg 
GGA GAA CGG 



Ser Ser Val 
TCT TCG GTC 



Ala Gly Cys 
GCG GGC TGT 

A 

e 

Gly(=) 

FIG 
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GTT 
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Arg 
AGG 


Thr 
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GGC 
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TCG 
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GAG 
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GCT 
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Arg 
CGC 


Thr 
ACG 


Gin 
CAG 


Arg 
AGG 


Gly 
GGC 


Arg 
AGG 


Val 
GTG 


Thr 
ACT 


Pro 
CCA 
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Asp 
GAT 


Cys 
TGT 


Tyr 
TAT 


Asp 
GAC 


Glu 
GAG 


Leu 
CTC 


Thr 
ACG 
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Pro Ala Glu Thr Ser Val Arg Leu Arg 
Jl 248 CCC GCT GAG ACC TCG GTT AGG TTG CGG 



Ala Tyr Leu Asn Thr Pro Gly Leu Pro 
Jl 275 GCT TAC CTA AAT ACA CCA GGG TTG CCC 



Val Cys Gin Asp His Leu Glu Phe Trp 
Jl 302 GTC TGC CAG GAC CAT CTG GAG TTC TGG 



Glu Ser Val Phe Thr Gly Leu Thr His 

Jl 329 GAG AGC GTC TTC ACA GGC CTC ACC CAC 



He Asp Ala His Phe Leu Ser Gin Thr 
Jl 356 ATA GAC GCC CAC TTC TTG TCC CAG ACT 



Lys Gin Ala Gly Asp Asn Phe Pro Tyr 
Jl 383 AAG CAG GCA GGA GAC AAC TTC CCC TAC 



Leu Val Ala Tyr Gin Ala Thr Val Cys 
Jl 410 CTG GTA GCA TAC CAA GCC ACA GTG TGC 



Ala Arg Ala Lys Ala Pro Pro Pro Ser 
Jl 437 GCC AGG GCT AAG GCT CCA CCT CCA TCG 

C 
e 

Ala(=) 



Trp Asp Gin Met Trp Lys Cys Leu He 
Jl 464 TGG GAT CAA ATG TGG AAG TGT CTC ATA 



Arg Leu Lys Pro Thr Leu His Gly Pro 
Jl 491 CGG CTA AAG CCT ACG CTG CAC GGG CCA 

G 
e 
Ala 
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Thr Pro Leu Leu Tyr Arg Leu Gly Ala 
Jl 518 ACG CCC CTG CTG TAT AGG CTA GGA GCC 

A 
e 

Arg(=) 



Jl 



545 



Val Gin Asn Glu Val Thr Leu Thr His 
GTC CAG AAT GAG GTC ACC CTC ACA CAC 



Pro lie Thr Lys 
Jl 572 CCT ATA ACC AAA 

FIG. 4-3 
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Leu Thr 

Jl 1 C CTC ACC 

discrepancy 

clone 

altered aa 



Arg Asp Pro Thr Val Pro Leu Ala Arg 
j! 8 CGT GAC CCC ACC GTC CCC CTT GCG CGG 

Ala Ala Trp Glu Thr Ala Arg His Thr 
Jl 35 GCT GCG TGG GAG ACA GCT AGA CAC ACT 

C 

g 

Thr(=) 



Pro Val Asn Ser Trp Leu Gly Asn lie 
Jl 62 CCA GTC AAC TCC TGG CTA GGC AAC ATC 



lie Met Tyr Ala Pro Thr Leu Trp Ala 
Jl 89 ATC ATG TAT GCG CCC ACT TTG TGG GCA 

T 
9 

Ile(=) 



Arg Met lie Leu Met Thr His Phe Phe 
Jl 116 AGG ATG ATT CTG ATG ACT CAC TTC TTC 

Ser lie Leu Leu Ala Gin Glu Gin Leu 
Jl 143 TCC ATC CTT CTA GCC CAG GAG CAA CTT 



Glu Lys Ala Leu Asp Cys Gin He Tyr 
Jl 170 GAA AAA GCC CTG GAT TGT CAA ATC TAC 

Gly Ala Cys Tyr Ser He Glu Pro Leu 
Jl " -197 GGG GCC TGT TAC TCC ATT GAG CCA CTT 

FIG. 5-1 
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Asp Leu Pro Gin lie He Glu Arg Leu 
Jl 224 GAC CTA CCT CAG ATC ATT GAA CGA CTC 



His Gly Leu Ser Ala Phe Ser Leu His 
Jl 251 CAT GGT CTT AGC GCA TTT TCA CTC CAT 



Ser Tyr Ser Pro Gly Glu He Asn Arg 
Jl 278 AGT TAC TCT CCA GGT GAG ATC AAT AGG 



Val Ala Ser Cys Leu Arg Lys Leu Gly 
Jl 305 GTG GCT TCA TGC CTC AGG AAG CTT GGG 



Val Pro Pro Leu Arg Val Trp Arg His 
Jl 332 GTA CCA CCC TTG CGA GTC TGG AGA CAT 



Arg Ala Arg Ser Val Arg Ala Lys Leu 
Jl 359 CGG GCC AGA AGT GTC CGC GCT AAG CTA 



Leu Ser Gin Gly Gly Arg Ala Ala Thr 
Jl 386 CTG TCC CAA GGG GGG AGG GCC GCC ACT 

G 

9 

Gln(=) 



Lys Gly Lys Tyr Leu 
Jl 413 TGT GGC AAG TAC CTC 

FIG. 5-2 
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J7 1 
HCV1 



AGCCGAGTAGTGTTGGGTCGCGAAAGGCCTTGTGGT 



J7 37 
HCV1 



ACTGCCTGATAGGGTGCTTGCGAGTGCCCCGGGAGG 



J7 73 
HCV1 



Met Ser Thr Asn 
TCTCGTAGACCGTGCATC ATG AGC ACA AAT 

C G 







Pro 


Lys 


Pro 


Gin 


Arg 


Lys 


Thr 


Lys 


Arg 


J7 


103 


CCT 


AAA 


CCC 


CAA 


AGA 


AAA 


ACC 


AAA 


CGT 


HCV1 








T 




A 




A 


















Lys 




Asn 






















*** 










Asn 


Thr 


Asn 


Arg 


Arg 


Pro 


Gin 


Asp 


Val 


J7 


130 


AAC 


ACC 


AAC 


CGT 


CGC 


CCA 


CAG 


GAC 


GTT 


HCV1 




















C 






Lys 


Phe 


Pro 


Gly 


Gly 


Gly 


Gin 


lie 


Val 


J7 


157 


AAG 


TTC 


CCG 


GGC 


GGT 


GGT 


CAG 


ATC 


GTC 


HCV1 










T 


C 








T 



J7 
HCV1 



184 



Gly Gly Val Tyr Leu Leu Pro Arg Arg 
GGT GGA GTT TAC TTG TTG CCG CGC AGG 



Gly Pro Arg Leu Gly Val Arg Ala Thr 
J7 211 GGC CCC AGG TTG GGT GTG CGT GCG ACT 
HCV1 T A C G 



Arg Lys Thr Ser Glu Arg Ser Gin Pro 
J7 238 AGG AAG ACT TCC GAG CGG TCG CAA CCT 
HCV1 A 



FIG. 6-1 
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Arg Gly Arg Arq Gin Pro lie Pro Lys 

J7 265 CGT GGA AGG CGA CAA CCT ATC CCC AAG 
HCV1 A T A T G 

Ala Arg Arg Pro Glu Gly Arg Thr Trp 

J7 292 GCT CGC CGG CCC GAG GGC AGG ACC TGG 
HCV1 T 



Ala Gin Pro Gly Tyr Pro Trp Pro Leu 
J7 319 GCT CAG CCT GGG TAT CCT TGG CCC CTC 
HCV1 C C 



Tyr Gly Asn Glu Gly Leu Gly Trp Ala 
J7 346 TAT GGC AAT GAG GGC TTG GGG TGG GCA 

HCV1 GC G 

Cys 
*** 



Gly Trp Leu Leu Ser Pro Arg Gly Ser 
J7 373 GGA TGG CTC CTG TCA CCC CGC GGC TCT 

T T 



HCV1 



Arg Pro Ser Trp Gly Pro Asn Asp Pro 
J7 400 CGG CCT AGT TGG GGC CCC AAT GAC CCC 
HCV1 C CA 

Thr 
*** 



Arg Arg Arg Ser Arg Asn Leu Gly Lys 
J7 427 CGG CGT AGG TCG CGT AAT TTG GGT AAG 
HCV1 c 

Val lie Asp Thr Leu Thr Cys Gly Phe 
J7 454 GTC ATC GAT ACC CTT ACA TGC GGC TTC 
HCV1 G 

FIG. 6-2 
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Ala Asp Leu Met Gly Tyr lie Pro Leu 
J7 481 GCC GAC CTC ATG GGG TAC ATT CCG CTT 
HCV1 A C 

Val Gly Ala Pro Leu Gly Gly Ala Ala 
J7 508 GTC GGC GCC CCC TTA GGG GGC GCT GCC 
HCV1 T C T A 



Arg Ala Leu Ala His Gly 
J7 535 AGG GCC CTG GCA CAT GGT 
HCV1 G C 



FIG. 6-3 
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Pro Leu Val Gly Ala Pro Leu Gly Gly 

Jl IT CCG CTC GTC GGC GCC CCC TTA GGG GGC 

A T C T A 

Ala Ala Arg Ala Leu Ala His Gly Val 

Tl 29 GCT GCC AGG GCC CTG GCA CAT GGT GTC 

G C 



Arg Val Leu Glu Asp Gly Val Asn Tyr 
jl 56 CGG GTT CTG GAG GAC GGC GTG AAC TAT 

A 



Ala Thr Gly Asn Leu Pro Gly Cys Ser 
Jl 83 GCA ACA GGG AAT TTG CCC GGT TGC TCT 

C C T T 



Phe Ser lie Phe Leu Leu Ala Leu Leu 
Jl 110 TTC TCT ATC TTC CTC TTG GCT CTG CTG 

T C C C 



Ser Cys Leu Thr He Pro Ala Ser Ala 
Jl 137 TCC TGT TTG ACC ATC CCA GCT TCC GCT 

T C TGGC GC 

Val 



Tyr Glu Val Arg Asn Val Ser Gly He 
Jl 164 TAT GAA GTG CGC AAC GTG TCC GGG ATA 

c c TCC AG C T 

Gin Ser Thr Leu 

*** *** 



Tyr His Val Thr Asn Asp Cys Ser Asn 
Jl 191 TAC CAT GTC ACA AAC GAC TGC TCC AAC 

C C T T C T 

Pro 
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Ser Ser lie Val Tyr Glu Ala Ala Asp 
Jl 218 TCA AGC ATT GTG TAT GAG GCG GCG GAC 

G T C C T 



Val lie Met His Ala Pro Gly Cys Val 
Jl 245 GTG ATC ATG CAT GCC CCC GGG TGC GTG 

CC C C A T G C 

Ala Leu Thr 



Pro Cys Val Arg Glu Asn Asn Ser Ser 
Jl 272 CCC TGC GTT CGG GAG AAC AAT TCC TCC 

T T GG C G G 

Gly Ala 
*** 



Arg Cys Trp Val Ala Leu Thr Pro Thr 
299 CGT TGC TGG GTA GCG CTC ACT CCC ACG 
AG T G AGCT 

Met 



Leu 


Ala 


Ala 


Arg Asn 


Ala 


Ser 


Val 


Pro 


CTC 


GCG 


GCC 


AGG AAT 


GCC 


AGC 


GTC 


CCC 


G G 
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AA 


C 




Val 




Thr 


Asp 
*** 


Gly 


Lys 
*** 


Leu 





Thr 
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Leu Arg 


Arg 
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val 
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ACT 


ACG 


ACA 


TTA CGA 


CGC 


CAC 


GTC 


GAC 


G G 




CAG 


C T 


T 
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Ala 




Gin 
*** 








lie 





Leu 


Leu 


Val 


Gly Thr 


Ala 


Ala 


Phe 


Cys 


TTG 


CTC 


GTT 


GGG ACG 


GCT 


GCT 


TTC 


TGC 
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C 


GC 


C 


A C 
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T 








Ser 




Thr 


Leu 
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Ser Ala Met 
Jl 407 TCC GCT ATG 

G C C C 
Leu 



Gly Ser Val 
Jl 434 GGA TCT GTT 

G C 
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Tyr 


Val 


Gly 


Asp 


Leu 


Cys 
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GTG 


GGG 


GAT 


CTC 


TGC 
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Phe 
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He 


Ser 
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TTC 


CTC 
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TCC 


CAG 


CTG 
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GG 


A 








Val 


Gly 







Phe 


Thr 


Phe 


Ser 


Pro 


Arg 


Arg 


His 


GlU 


TTC 


ACC 


TTC 


TCG 


CCT 


CGC 


CGG 


CAT 


GAG 








T 


C 


A G 


C 


C 


TG 

Trp 

*** 



Thr Val Gin Asp Cys Asn Cys Ser He 
Jl 488 ACA GTA CAG GAC TGC AAC TGC TCA ATC 

G ACG A GT T T 

Thr Gly 
*** *** 



Tyr Pro Gly His Val Ser Gly His Arg 
Jl 515 TAT CCC GGC CAC GTA TCA GGC CAT CGC 

T A AG T C 
He Thr 



Met Ala Trp Asp Met Met Met Asn Trp 
Jl 542 ATG GCT TGG GAT ATG ATG ATG AAC TGG 

A 



Ser Pro Thr Ala 
Jl 569 TCG CCC ACG GCA 

C T AG 
Thr 
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19 


Ala 
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Vul 

GTG 
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V CI J 

GTG 
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Met 
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G T 
Ala 


Gin 
CAG 


Leu 
TTA 
C G 


Leu 
CTC 


Arg 
CGG 


Jl 
HCVl 


46 


He 
ATC 


Pro 
CCA 


Gin 
CAA 


Ala 
GCT 
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Val 
GTC 
A 

He 


Met 
ATG 
T 

Leu 


Asp 
GAC 


Met 
ATG 


Val 
GTG 
A C 
He 


Jl 
HCVl 


73 


Ala 
GCG 

T 


Gly 
GGG 
T 


Ala 
GCC 
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His 
CAC 


Trp 
TGG 


Gly 
GGA 


Val 
GTC 


Leu 
CTA 
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Ala 
GCG 


Jl 
HCVl 


Gly 

100 GGC 


Leu 
CTT 
A A 
He 


Ala 
GCC 
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Tyr 
TAC 
T 


Tyr 
TAT 
TC 
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TCC 


Met 
ATG 


Val 
GTG 


Gly 
GGG 


Jl 
HCVl 


127 


Asn 
AAC 


Trp 
TGG 


Ala 
GCT 
G 


Lys 
AAG 


Val 
GTT 
C 


Leu 
TTG 
C 


He Val 
ATT GTG 
G A 
Val 


Met 
ATG 
C 

Leu 


Jl 
HCVl 


154 


Leu 
CTA 
G 


Leu 
CTC 
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Phe 
TTT 


Ala 
GCC 


Gly 
GGC 


Val 
GTT 
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Asp 
GAC 
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GGG 

C 
Ala 
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CAT 
G A 
Glu 



*** 
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Val 
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Val 
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Phe 


Arg 
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Val 








*** 








*** 




*** 








Gin 


Leu 


Val 
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Gly 


Ser 


Trp 


Jl 


262 


CAG 


CTT 


GTA 


AAC 


ACC 


AAT 


GGC 


AGT 


TGG 


HCV1 
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A C 
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He 


















His 


He 
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Arg 


Thr 


Ala 


Leu 
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Jl 


289 


CAT 


ATC 


AAC 


AGG 


ACT 


GCC 


CTG 


AAC 


TGC 


HCV1 
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G 
















Leu 




Ser 






















*** 
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Asp 


Ser 
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Thr 


Gly 


Phe 


Leu 


Jl 


316 
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ACT 


GGG 
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CTT 


HCV1 






T 


AG 




A C 
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GG 


T G 














Asn 






Trp 





Ala Ala Leu 
Jl 343 GCC GCG CTG 

HCV1 AG T 

Gly 
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Jl 

HCVl 



Ser Val lie 
C TCA GTG ATC 
ggctataccggcgacttcga G A 



Asp Cys Asn Thr Cys Val Thr Gin Thr 
Jl 11 GAC TGT AAC ACA TGT GTC ACT CAG ACG 

HCVl C T G C A 



Val Asp Phe Ser Leu Asp Pro Thr Phe 
Jl 38 GTC GAT TTC AGC TTG GAT CCC ACC TTC 

HCVl C T C T 



Thr lie Glu Thr Thr Thr Val Pro Gin 

Jl 65 ACC ATC GAG ACG ACG ACC GTG CCC CAA 

HCVl T A TC G C C G 

lie 



Asp Ala Val Ser Arg Thr Gin Arg Arg 
Jl 92 GAT GCG GTT TCG CGC ACG CAG CGG CGA 

HCVl T C C T A T G 



Gly Arg Thr Gly Arg Gly Arg Arg Gly 
Jl 119 GGT AGG ACT GGC AGG GGC AGG AGA GGC 

HCVl C G A CC 

Lys Pro 



lie Tyr Arg Phe Val Thr Pro Gly Glu 

Jl 146 ATC TAT AGG TTT GTG ACT CCA GGA GAA 

HCVl C A G A G G G 

Ala 



Arg Pro Ser Ala Met Phe Asp Ser Ser 
Jl 173 CGG CCC TCG GCG ATG TTC GAT TCT TCG 

HCVl C C GC CGC 

Gly 
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Val Leu Cys Glu Cys Tyr Asp Ala Gly 
Jl 200 GTC CTA TGT GAG TGT TAT GAC GCG GGC 

HCV1 C CA 

Cys Ala Tip Tyr Glu Leu Thr Pro Ala 
Jl 227 TGT GCT TGG TAT GAG CTC ACG CCC GCT 

HCV1 C 

Glu Thr Ser Val Arg Leu Arg Ala Tyr 
Jl 254 GAG ACC TCG GTT AGG TTG CGG GCT TAC 

HCV1 T A A C A A G 

Thr 



Leu Asn Thr Pro Gly Leu Pro Val Cys 
Jl 281 CTA AAT ACA CCA GGG TTG CCC GTC TGC 

HCV1 AGCCG CT G 

Met 



Gin Asp His Leu Glu Phe Trp Glu Ser 
Jl 308 CAG GAC CAT CTG GAG TTC TGG GAG AGC 

TAT G 

Gly 



HCV1 



Val Phe Thr Gly Leu Thr His lie Asp 
Jl 335 GTC TTC ACA GGC CTC ACC CAC ATA GAC 

HCV1 T T T T 



Ala His Phe Leu Ser Gin Thr Lys Gin 
Jl 362 GCC CAC TTC TTG TCC CAG ACT AAG CAG 

HCV1 T C A A 



Ala Gly Asp Asn Phe Pro Tyr Leu Val 
Jl 389 GCA GGA GAC AAC TTC CCC TAC CTG GTA 

HCV1 AGT G G C T T 

_ Ser Glu Leu 
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Ala Tyr Gin Ala Thr Val Cys Ala Arg 
Jl 416 GCA TAC CAA GCC ACA GTG TGC GCC AGG 

HCV1 G C T 

Ala Lys Ala Pro Pro Pro Ser Trp Asp 
Jl 443 GCT AAG GCT CCA CCT CCA TCG TGG GAT 

HCV1 C A C T C C 

Gin 



Gin Met Trp Lys Cys Leu lie Arg Leu 
Jl 470 CAA ATG TGG AAG TGT CTC ATA CGG CTA 

HCV1 G T G T C C 

Lys Pro Thr Leu His Gly Pro Thr Pro 
Jl 497 AAG CCT ACG CTG CAC GGG CCA ACG CCC 

HCV1 C C C T A 

Leu Leu Tyr Arg Leu Gly Ala Val Gin 
Jl 524 CTG CTG TAT AGG CTA GGA GCC GTC CAG 

HCV1 A C A G C T T 



Asn Glu Val Thr Leu Thr His Pro lie 
Jl 551 AAT GAG GTC ACC CTC ACA CAC CCT ATA 

HCV1 A A G G A G C 

He Val 



Thr Lys 
Jl 578 ACC AAA 

HCV1 tacatcatgacatgcatgtc 

FIG. 9-3 



SUBSTITUTE SHEET 



WO 91/04262 



25/ 79 



Jl 1 
HCVl 



PC17US90/05242 



Leu Thr 
C CTC ACC 



Jl 
HCVl 



Jl 
HCVl 



Jl 
HCVl 



Jl 
HCVl 





Arg 


Asp 


Pro 


inr 


Val 


Pro 


Leu 


AT a 

AX CL 


ai y 


8 


CGT 


GAC 


CCC 


ACC 


GTC 


CCC 


CTT 


GCG 


CGG 








m 

T 


A 


AC 




C 
















Tnr 




















.t. -1. .t. 

XXX 












Ala 


Ala 


Trp 


Glu 


Thr 


Ala 


Arg 


His 


Thr 


35 


GCT 


GCG 


TGG 


GAG 


ACA 


GCT 
A 


AGA 


CAC 


ACT 




Pro 


Val 


Asn 


Ser 
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Thr 
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GCG 


CCC 


ACT 
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TGG 


GCA 
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A 
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G 



Phe 



Jl 
HCVl 



Arg Met He Leu Met Thr His Phe Phe 
116 AGG ATG ATT CTG ATG ACT CAC TTC TTC 

A CTT 



Jl 
HCVl 



Ser He Leu Leu Ala Gin Glu Gin Leu 
143 TCC ATC CTT CTA GCC CAG GAG CAA CTT 
AG G A AG C G 

Val He Arg Asp 

*** 



Jl 
HCVl 



Glu Lys Ala Leu Asp Cys Gin He Tyr 

170 GAA AAA GCC CTG GAT TGT CAA ATC TAC 

C G C CGG 

Gin Glu 

*** *** 
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Gly Ala Cys ^Tyr Ser lie Glu Pro Leu 

Jl 197 GGG GCC TGT TAC TCC ATT GAG CCA CTT 

HCV1 C A A 



Asp Leu Pro Gin lie lie Glu Arg Leu 
Jl 224 GAC CTA CCT CAG ATC ATT GAA CGA CTC 
HCV1 T CA C A 

Pro Gin 
*** *** 



His Gly Leu Ser Ala Phe Ser Leu His 
Jl 251 CAT GGT CTT AGC GCA TTT TCA CTC CAT 
HCV1 C C C 



Ser Tyr Ser Pro Gly Glu lie Asn Arg 
Jl 278 AGT TAC TCT CCA GGT GAG ATC AAT AGG 
HCV1 A T 



Val Ala Ser Cys Leu Arg Lys Leu Gly 
Jl 305 GTG GCT TCA TGC CTC AGG AAG CTT GGG 
HCV1 C G A A 

Ala 



Val Pro Pro Leu Arg Val Trp Arg His 
Jl 332 GTA CCA CCC TTG CGA GTC TGG AGA CAT 
HCV1 G CT C 

Ala 
*** 



Arg Ala Arg Ser Val Arg Ala Lys Leu 
Jl 359 CGG GCC AGA AGT GTC CGC GCT AAG CTA 
HCV1 CGC G T 

Arg 



Leu Ser Gin Gly Gly Arg Ala Ala Thr 

Jl 386 CTG TCC CAA GGG GGG AGG GCC GCC ACT 

HCV1 G AG A C T TA 

Ala Arg lie 

*** *** 
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HCV-l 



-267 GCGTCTAGCCATGGCGTTAGTATGAGTGTCGTGCAGCCTCCAGG 
CGCAGATCGGTACCGCAATCATACTCACAGCACGTCGGAGGTCC 



-223 ACCCCCCCTCCCGGGAGAGCCATAGTGGTCTGCGGAACCGGTGA 
TGGGGGGGAGGGCCCTCTCGGTATCACCAGACGCCTTGGCCACT 



-179 GTACACCGGAATTGCCAGGACGACCGGGTCCTTTCTTGGATCAA 
CATGTGGCCTTAACGGTCCTGCTGGCCCAGGAAAGAACCTAGTT 



-135 CCCGCTCAATGCCTGGAGATTTGGGCGTGCCCCCGCAAGACTGC 
GGGCGAGTTACGGACCTCTAAACCCGCACGGGGGCGTTCTGACG 



-9 1 TAGCCGAGTAGTGTTGGGTCGCGAAAGGCCTTGTGGTACTGCCT 
ATCGGCTCATCACAACCCAGCGCTTTCCGGAACACCATGACGGA 



- 4 7 GATAGGGTGCTTGCGAGTGCCCCGGGAGGTCTCGTAGACCGTGC 
CTATCCCACGAACGCTCACGGGGCCCTCCAGAGCATCTGGCACG 



-3 ACC -1 
TGG 



Met Ser Thr Asn Pro Lys Pro Gin Lys Lys Asn 
1 ATG AGC ACG AAT CCT AAA CCT CAA AAA AAA AAC 
TAC TCG TGC TTA GGA TTT GGA GTT TTT TTT TTG 



Lys Arg Asn Thr Asn Arg Arg Pro Gin Asp Val 

34 AAA CGT AAC ACC AAC CGT CGC CCA CAG GAC GTC 
TTT GCA TTG TGG TTG GCA GCG GGT GTC CTG CAG 



Lys Phe Pro Gly Gly Gly Gin lie Val Gly Gly 
67 AAG TTC CCG GGT GGC GGT CAG ATC GTT GGT GGA 
TTC AAG GGC CCA CCG CCA GTC TAG CAA CCA CCT 
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Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu 
GTT TAC TTG TTG CCG CGC AGG GGC CCT AGA TTG 
CAA ATG AAC AAC GGC GCG TCC CCG GGA TCT AAC 



Gly Val Arg Ala Thr Arg Lys Thr Ser Glu Arg 
GGT GTG CGC GCG ACG AGA AAG ACT TCC GAG CGG 
CCA CAC GCG CGC TGC TCT TTC TGA AGG CTC GCC 



Ser Gin Pro Arg Gly Arg Arg Gin Pro lie Pro 
TCG CAA CCT CGA GGT AGA CGT CAG CCT ATC CCC 
AGC GTT GGA GCT CCA TCT GCA GTC GGA TAG GGG 



Lys Ala Arg Arg Pro Glu Gly Arg Thr Trp Ala 
AAG GCT CGT CGG CCC GAG GGC AGG ACC TGG GCT 
TTC CGA GCA GCC GGG CTC CCG TCC TGG ACC CGA 



Gin Pro Gly Tyr Pro Trp Pro Leu Tyr Gly Asn 
CAG CCC GGG TAC CCT TGG CCC CTC TAT GGC AAT 
GTC GGG CCC ATG GGA ACC GGG GAG ATA CCG TTA 



Glu Gly Cys Gly Trp Ala Gly Trp Leu Leu Ser 
GAG GGC TGC GGG TGG GCG GGA TGG CTC CTG TCT 
CTC CCG ACG CCC ACC CGC CCT ACC GAG GAC AGA 



Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Thr 
CCC CGT GGC TCT CGG CCT AGC TGG GGC CCC ACA 
GGG GCA CCG AGA GCC GGA TCG ACC CCG GGG TGT 



Asp Pro Arg Arg Arg Ser Arg Asn Leu Gly Lys 
GAC CCC CGG CGT AGG TCG CGC AAT TTG GGT AAG 
CTG GGG GCC GCA TCC AGC GCG TTA AAC CCA TTC 



Val lie Asp Thr Leu Thr Cys Gly Phe Ala Asp 
GTC ATC GAT ACC CTT ACG TGC GGC TTC GCC GAC 
CAG TAG CTA TGG GAA TGC ACG CCG AAG CGG CTG 
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Leu Met Gly Tyr He Pro Leu Val Gly Ala Pro 

397 CTC ATG GGG TAC ATA CCG CTC GTC GGC GCC CCT 

GAG TAC CCC ATG TAT GGC GAG CAG CCG CGG GGA 



Leu Gly Gly Ala Ala Arg Ala Leu Ala His Gly 
430 CTT GGA GGC GCT GCC AGG GCC CTG GCG CAT GGC 
GAA CCT CCG CGA CGG TCC CGG GAC CGC GTA CCG 



Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
463 GTC CGG GTT CTG GAA GAC GGC GTG AAC TAT GCA 
CAG GCC CAA GAC CTT CTG CCG CAC TTG ATA CGT 



Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He 
496 ACA GGG AAC CTT CCT GGT TGC TCT TTC TCT ATC 
TGT CCC TTG GAA GGA CCA ACG AGA AAG AGA TAG 



Phe Leu Leu Ala Leu Leu Ser Cys Leu Thr Val 
529 TTC CTT CTG GCC CTG CTC TCT TGC TTG ACT GTG 
AAG GAA GAC CGG GAC GAG AGA ACG AAC TGA CAC 



Pro Ala Ser Ala Tyr Gin Val Arg Asn Ser Thr 
562 CCC GCT TCG GCC TAC CAA GTG CGC AAC TCC ACG 
GGG CGA AGC CGG ATG GTT CAC GCG TTG AGG TGC 



Gly Leu Tyr His Val Thr Asn Asp Cys Pro Asn 
595 GGG CTT TAC CAC GTC ACC AAT GAT TGC CCT AAC 
CCC GAA ATG GTG CAG TGG TTA CTA ACG GGA TTG 



Ser Ser He Val Tyr Glu Ala Ala Asp Ala He 
628 TCG AGT ATT GTG TAC GAG GCG GCC GAT GCC ATC 
AGC TCA TAA CAC ATG CTC CGC CGG CTA CGG TAG 



Leu His Thr Pro Gly Cys Val Pro Cys Val Arg 

661 CTG CAC ACT CCG GGG TGC GTC CCT TGC GTT CGT 
GAC GTG TGA GGC CCC ACG CAG GGA ACG CAA GCA 
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Glu Gly Asn Ala Ser Arg Cys. Trp Val Ala Met 
GAG GGC AAC GCC TCG AGG TGT TGG GTG GCG ATG 
CTC CCG TTG CGG AGC TCC ACA ACC CAC CGC TAC 



Thr Pro Thr Val Ala Thr Arg Asp Gly Lys Leu 
ACC CCT ACG GTG GCC ACC AGG GAT GGC AAA CTC 
TGG GGA TGC CAC CGG TGG TCC CTA CCG TTT GAG 



Pro Ala Thr Gin Leu Arg Arg His lie Asp Leu 
CCC GCG ACG CAG CTT CGA CGT CAC ATC GAT CTG 
GGG CGC TGC GTC GAA GCT GCA GTG TAG CTA GAC 



Leu Val Gly Ser Ala Thr Leu Cys Ser Ala Leu 
CTT GTC GGG AGC GCC ACC CTC TGT TCG GCC CTC 
GAA CAG CCC TCG CGG TGG GAG ACA AGC CGG GAG 



Tyr Val Gly Asp Leu Cys Gly Ser Val Phe Leu 
TAC GTG GGG GAC CTA TGC GGG TCT GTC TTT CTT 
ATG CAC CCC CTG GAT ACG CCC AGA CAG AAA GAA 



Val Gly Gin Leu Phe Thr Phe Ser Pro Arg Arg 
GTC GGC CAA CTG TTC ACC TTC TCT CCC AGG CGC 
CAG CCG GTT GAC AAG TGG AAG AGA GGG TCC GCG 



His Trp Thr Thr Gin Gly Cys Asn Cys Ser lie 
CAC TGG ACG ACG CAA GGT TGC AAT TGC TCT ATC 
GTG ACC TGC TGC GTT CCA ACG TTA ACG AGA TAG 



Tyr Pro Gly His lie Thr Gly His Arg Met Ala 
TAT CCC GGC CAT ATA ACG GGT CAC CGC ATG GCA 
ATA GGG CCG GTA TAT TGC CCA GTG GCG TAC CGT 



Trp Asp Met Met Met Asn Trp Ser Pro Thr Thr 
TGG GAT ATG ATG ATG AAC TGG TCC CCT ACG ACG 
ACC CTA TAC TAC TAC TTG ACC AGG GGA TGC TGC 
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Ala Leu Val Met Ala Gin Leu Leu Arg lie Pro 
GCG TTG GTA ATG GCT CAG CTG CTC CGG ATC CCA 
CGC AAC CAT TAC CGA GTC GAC GAG GCC TAG GGT 



Gin Ala lie Leu Asp Met lie Ala Gly Ala His 
CAA GCC ATC TTG GAC ATG ATC GCT GGT GCT CAC 
GTT CGG TAG AAC CTG TAC TAG CGA CCA CGA GTG 



Trp Gly Val Leu Ala Gly lie Ala Tyr Phe Ser 
TGG GGA GTC CTG GCG GGC ATA GCG TAT TTC TCC 
ACC CCT CAG GAC CGC CCG TAT CGC ATA AAG AGG 



Met Val Gly Asn Trp Ala Lys Val Leu Val Val 
ATG GTG GGG AAC TGG GCG AAG GTC CTG GTA GTG 
TAC CAC CCC TTG ACC CGC TTC CAG GAC CAT CAC 



Leu Leu Leu Phe Ala Gly Val Asp Ala Glu Thr 
CTG CTG CTA TTT GCC GGC GTC GAC GCG GAA ACC 
GAC GAC GAT AAA CGG CCG CAG CTG CGC CTT TGG 



His Val Thr Gly Gly Ser Ala Gly His Thr Val 
CAC GTC ACC GGG GGA AGT GCC GGC CAC ACT GTG 
GTG CAG TGG CCC CCT TCA CGG CCG GTG TGA CAC 



Ser Gly Phe Val Ser Leu Leu Ala Pro Gly Ala 
TCT GGA TTT GTT AGC CTC CTC GCA CCA GGC GCC 
AGA CCT AAA CAA TCG GAG GAG CGT GGT CCG CGG 



Lys Gin Asn Val Gin Leu lie Asn Thr Asn Gly 

AAG CAG AAC GTC CAG CTG ATC AAC ACC AAC GGC 
TTC GTC TTG CAG GTC GAC TAG TTG TGG TTG CCG 



Ser Trp His Leu Asn Ser Thr Ala Leu Asn Cys 
AGT TGG CAC CTC AAT AGC ACG GCC CTG AAC TGC 
TCA ACC GTG GAG TTA TCG TGC CGG GAC TTG ACG 
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Asn Asp Ser Leu Asn Thr Gly Tip Leu Ala Gly 
AAT GAT AGC CTC AAC ACC GGC TGG TTG GCA GGG 
TTA CTA TCG GAG TTG TGG CCG ACC AAC CGT CCC 



Leu Phe Tyr His His Lys Phe Asn Ser Ser Gly 
TTT TCT ATC ACC ACA AGT TCA ACT CTT CAG GCT 
GAA AAG ATA GTG GTG TTC AAG TTG AGA AGT CCG 



Cys Pro Glu Arg Leu Ala Ser Cys Arg Pro Leu 
GTC CTG AGA GGC TAG CCA GCT GCC GAC CCC CTT 
ACA GGA CTC TCC GAT CGG TCG ACG GCT GGG GAA 



Thr Asp Phe Asp Gin Gly Trp Gly Pro He Ser 
ACC GAT TTT GAC CAG GGC TGG GGC CCT ATC AGT 
TGG CTA AAA CTG GTC CCG ACC CCG GGA TAG TCA 



Tyr Ala Asn Gly Ser Gly Pro Asp Gin Arg Pro 
TAT GCC AAC GGA AGC GGC CCC GAC CAG CGC CCC 
ATA CGG TTG CCT TCG CCG GGG CTG GTC GCG GGG 



Tyr Cys Trp His Tyr Pro Pro Lys Pro Cys Gly 
TAC TGC TGG CAC TAC CCC CCA AAA CCT TGC GGT 
ATG ACG ACC GTG ATG GGG GGT TTT GGA ACG CCA 



He Val Pro Ala Lys Ser Val Cys Gly Pro Val 
ATT GTG CCC GCG AAG AGT GTG TGT GGT CCG GTA 
TAA CAC GGG CGC TTC TCA CAC ACA CCA GGC CAT 



Tyr Cys Phe Thr Pro Ser Pro Val Val Val Gly 
TAT TGC TTC ACT CCC AGC CCC GTG GTG GTG GGA 
ATA ACG AAG TGA GGG TCG GGG CAC CAC CAC CCT 



Thr Thr Asp Arg Ser Gly Ala Pro Thr Tyr Ser 
ACG ACC GAC AGG TCG GGC GCG CCC ACC TAC AGC 
TGC TGG CTG TCC AGC CCG CGC GGG TGG ATG TCG 
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Trp Gly Glu Asn Asp Thr Asp Val Phe Val Leu 
TGG GGT GAA AAT GAT ACG GAC GTC TTC GTC CTT 
ACC CCA CTT TTA CTA TGC CTG CAG AAG CAG GAA 



Asn Asn Thr Arg Pro Pro Leu Gly Asn Trp Phe 

AAC AAT ACC AGG CCA CCG CTG GGC AAT TGG TTC 
TTG TTA TGG TCC GGT GGC GAC CCG TTA ACC AAG 



Gly Cys Thr Trp Met Asn Ser Thr Gly Phe Thr 
GGT TGT ACC TGG ATG AAC TCA ACT GGA TTC ACC 
CCA ACA TGG ACC TAC TTG AGT TGA CCT AAG TGG 



Lys Val Cys Gly Ala Pro Pro Cys Val lie Gly 
AAA GTG TGC GGA GCG CCT CCT TGT GTC ATC GGA 
TTT CAC ACG CCT CGC GGA GGA ACA CAG TAG CCT 



Gly Ala Gly Asn Asn Thr Leu His Cys Pro Thr 
GGG GCG GGC AAC AAC ACC CTG CAC TGC CCC ACT 
CCC CGC CCG TTG TTG TGG GAC GTG ACG GGG TGA 



Asp Cys Phe Arg Lys His Pro Asp -Ala Thr Tyr 
GAT TGC TTC CGC AAG CAT CCG GAC GCC ACA TAC 
CTA ACG AAG GCG TTC GTA GGC CTG CGG TGT ATG 



Ser Arg Cys Gly Ser Gly Pro Trp lie Thr Pro 
TCT CGG TGC GGC TCC GGT CCC TGG ATC ACA CCC 
AGA GCC ACG CCG AGG CCA GGG ACC TAG TGT GGG 



Arg Cys Leu Val Asp Tyr Pro Tyr Arg Leu Trp 
AGG TGC CTG GTC GAC TAC CCG TAT AGG CTT TGG 
TCC ACG GAC CAG CTG ATG GGC ATA TCC GAA ACC 



His Tyr Pro Cys Thr lie Asn Tyr Thr lie Phe 
CAT TAT CCT TGT ACC ATC AAC TAC ACC ATA TTT 
GTA ATA GGA ACA TGG TAG TTG ATG TGG TAT AAA 
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Lys lie Arg Met Tyr Val Gly Gly Val Glu His 
AAA ATC AGG ATG TAC GTG GGA GGG GTC GAA CAC 
TTT TAG TCC TAC ATG CAC CCT CCC CAG CTT GTG 



Arg Leu Glu Ala Ala Cys Asn Trp Thr Arg Gly 
AGG CTG GAA GCT GCC TGC AAC TGG ACG CGG GGC 
TCC GAC CTT CGA CGG ACG TTG ACC TGC GCC CCG 



Glu Arg Cys Asp Leu Glu Asp Arg Asp Arg Ser 
GAA CGT TGC GAT CTG GAA GAC AGG GAC AGG TCC 
CTT GCA ACG CTA GAC CTT CTG TCC CTG TCC AGG 



Glu Leu Ser Pro Leu Leu Leu Thr Thr Thr Gin 
GAG CTC AGC CCG TTA CTG CTG ACC ACT ACA CAG 
CTC GAG TCG GGC AAT GAC GAC TGG TGA TGT GTC 



Trp Gin Val Leu Pro Cys Ser Phe Thr Thr Leu 
TGG CAG GTC CTC CCG TGT TCC TTC ACA ACC CTA 
ACC GTC CAG GAG GGC ACA AGG AAG TGT TGG GAT 



Pro Ala Leu Ser Thr Gly Leu lie His Leu His 
CCA GCC TTG TCC ACC GGC CTC ATC CAC CTC CAC 
GGT CGG AAC AGG TGG CCG GAG TAG GTG GAG GTG 



Gin Asn He Val Asp Val Gin Tyr Leu Tyr Gly 
CAG AAC ATT GTG GAC GTG CAG TAC TTG TAC GGG 
GTC TTG TAA CAC CTG CAC GTC ATG AAC ATG CCC 



Val Gly Ser Ser He Ala Ser Trp Ala He Lys 
GTG GGG TCA AGC ATC GCG TCC TGG GCC ATT AAG 
CAC CCC AGT TCG TAG CGC AGG ACC CGG TAA TTC 



Trp Glu Tyr Val Val Leu Leu Phe Leu Leu Leu 
TGG GAG TAC GTC GTT CTC CTG TTC CTT CTG CTT 
ACC CTC ATG CAG CAA GAG GAC AAG GAA GAC GAA 
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Ala Asp Ala Arg Val Cys Ser Cys Leu Trp Met 
GCA GAC GCG CGC GTC TGC TCC TGC TTG TGG ATG 
CGT CTG CGC GCG CAG ACG AGG ACG AAC ACC TAC 



Met Leu Leu lie Ser Gin Ala Glu Ala Ala Leu 
ATG CTA CTC ATA TCC CAA GCG GAG GCG GCT TTG 
TAC GAT GAG TAT AGG GTT CGC CTC CGC CGA AAC 



Glu Asn Leu Val lie Leu Asn Ala Ala Ser Leu 
GAG AAC CTC GTA ATA CTT AAT GCA GCA TCC CTG 
CTC TTG GAG CAT TAT GAA TTA CGT CGT AGG GAC 



Ala Gly Thr His Gly Leu Val Ser Phe Leu Val 
GCC GGG ACG CAC GGT CTT GTA TCC TTC CTC GTG 
CGG CCC TGC GTG CCA GAA CAT AGG AAG GAG CAC 



Phe Phe Cys Phe Ala Trp Tyr Leu Lys Gly Lys 
TTC TTC TGC TTT GCA TGG TAT TTG AAG GGT AAG 
AAG AAG ACG AAA CGT ACC ATA AAC TTC CCA TTC 



Trp Val Pro Gly Ala Val Tyr Thr Phe Tyr Gly 
TGG GTG CCC GGA GCG GTC TAC ACC TTC TAC GGG 
ACC CAC GGG CCT CGC CAG ATG TGG AAG ATG CCC 



Met Trp Pro Leu Leu Leu Leu Leu Leu Ala Leu 
ATG TGG CCT CTC CTC CTG CTC CTG TTG GCG TTG 
TAC ACC GGA GAG GAG GAC GAG GAC AAC CGC AAC 



Pro Gin Arg Ala Tyr Ala Leu Asp Thr Glu Val 
CCC CAG CGG GCG TAC GCG CTG GAC ACG GAG GTG 
GGG GTC GCC CGC ATG CGC GAC CTG TGC CTC CAC 



Ala Ala Ser Cys Gly Gly Val Val Leu Val Gly 
GCC GCG TCG TGT GGC GGT GTT GTT CTC GTC GGG 
CGG CGC AGC ACA CCG CCA CAA CAA GAG CAG CCC 
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Leu Met Ala Leu Thr Leu Ser Pro Tyr Tyr Lys 
2476 TTG ATG GCG CTG ACT CTG TCA CCA TAT TAC AAG 
AAC TAC CGC GAC TGA GAC AGT GGT ATA ATG TTC 



Arg Tyr lie 
2509 CGC TAT ATC 
GCG ATA TAG 



Ser Trp Cys Leu 
AGC TGG TGC TTG 
TCG ACC ACG AAC 



Trp Trp Leu Gin 
TGG TGG CTT CAG 
ACC ACC GAA GTC 



Tyr Phe Leu Thr Arg Val Glu Ala Gin Leu His 
2542 TAT TTT CTG ACC AGA GTG GAA GCG CAA CTG CAC 
ATA AAA GAC TGG TCT CAC CTT CGC GTT GAC GTG 



Val Trp lie Pro Pro Leu Asn Val Arg Gly Gly 
2575 GTG TGG ATT CCC CCC CTC AAC GTC CGA GGG GGG 
CAC ACC TAA GGG GGG GAG TTG CAG GCT CCC CCC 



Arg Asp Ala Val lie Leu Leu Met Cys Ala Val 
2608 CGC GAC GCC GTC ATC TTA CTC ATG TGT GCT GTA 
GCG CTG CGG CAG TAG AAT GAG TAC ACA CGA CAT 



His Pro Thr Leu Val Phe Asp lie Thr Lys Leu 
2641 CAC CCG ACT CTG GTA TTT GAC ATC ACC AAA TTG 
GTG GGC TGA GAC CAT AAA CTG TAG TGG TTT AAC 



Leu Leu Ala 
2674 CTG CTG GCC 
GAC GAC CGG 



Val Phe Gly Pro 
GTC TTC GGA CCC 
CAG AAG CCT GGG 



Leu Trp lie Leu 
CTT TGG ATT CTT 
GAA ACC TAA GAA 



Gin Ala Ser Leu Leu Lys Val Pro Tyr Phe Val 
2707 CAA GCC AGT TTG CTT AAA GTA CCC TAC TTT GTG 
GTT CGG TCA AAC GAA TTT CAT GGG ATG AAA CAC 



Arg Val Gin 
2740_ CGC GTC CAA 
GCG CAG GTT 



Gly Leu Leu Arg 
GGC CTT CTC CGG 
CCG GAA GAG GCC 

FIG. 12^10 



Phe Cys Ala Leu 
TTC TGC GCG TTA 
AAG ACG CGC AAT 



SUBSTITUTE SHEET 



WO 91/04262 PCT/USW/05242 

38/ 79 

Ala Arg Lys Met He Gly Gly His Tyr Val Gin 
2773 GCG CGG AAG ATG ATC GGA GGC CAT TAC GTG CAA 
CGC GCC TTC TAC TAG CCT CCG GTA ATG CAC GTT 



Met Val He He Lys Leu Gly Ala Leu Thr Gly 
2806 ATG GTC ATC ATT AAG TTA GGG GCG CTT ACT GGC 
TAC CAG TAG TAA TTC AAT CCC CGC GAA TGA CCG 



Thr Tyr Val 

2839 ACC TAT GTT 
TGG ATA CAA 



Asp Trp Ala 
2872 GAC TGG GCG 
CTG ACC CGC 



Val Ala Val 
2905 GTG GCT GTA 
CAC CGA CAT 



Tyr Asn His Leu 
TAT AAC CAT CTC 
ATA TTG GTA GAG 



His Asn Gly Leu 
CAC AAC GGC TTG 
GTG TTG CCG AAC 



Glu Pro Val Val 

GAG CCA GTC GTC 
CTC GGT CAG CAG 



Thr Pro Leu Arg 
ACT CCT CTT CGG 
TGA GGA GAA GCC 



Arg Asp Leu Ala 
CGA GAT CTG GCC 
GCT CTA GAC CGG 



Phe Ser Gin Met 

TTC TCC CAA ATG 
AAG AGG GTT TAC 



Glu Thr Lys Leu He Thr Trp Gly Ala Asp Thr 
2938 GAG ACC AAG CTC ATC ACG TGG GGG GCA GAT ACC 
CTC TGG TTC GAG TAG TGC ACC CCC CGT CTA TGG 



Ala Ala Cys Gly Asp He He Asn Gly Leu Pro 
2971 GCC GCG TGC GGT GAC ATC ATC AAC GGC TTG CCT 
CGG CGC ACG CCA CTG TAG TAG TTG CCG AAC GGA 



Val Ser Ala Arg Arg Gly Arg Glu He Leu Leu 
3004 GTT TCC GCC CGC AGG GGC CGG GAG ATA CTG CTC 
CAA AGG CGG GCG TCC CCG GCC CTC TAT GAC GAG 



Gly Pro Ala Asp Gly Met Val Ser Lys Gly Trp 
3037 GGG CCA GCC GAT GGA ATG GTC TCC AAG GGG TGG 
CCC GGT CGG CTA CCT TAC CAG AGG TTC CCC ACC 
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Arg Leu Leu Ala Pro lie Thr Ala Tyr Ala Gin 
3070 AGG TTG CTG GCG CCC ATC ACG GCG TAC GCC CAG 
TCC AAC GAC CGC GGG TAG TGC CGC ATG CGG GTC 



Gin Thr Arg Gly Leu Leu Gly Cys lie lie Thr 
3103 CAG ACA AGG GGC CTC CTA GGG TGC ATA ATC ACC 
GTC TGT TCC CCG GAG GAT CCC ACG TAT TAG TGG 



Ser Leu Thr Gly Arg Asp Lys Asn Gin Val Glu 
3136 AGC CTA ACT GGC CGG GAC AAA AAC CAA GTG GAG 
TCG GAT TGA CCG GCC CTG TTT TTG GTT CAC CTC 



Gly Glu Val Gin He Val Ser Thr Ala Ala Gin 
3169 GGT GAG GTC CAG ATT GTG TCA ACT GCT GCC CAA 
CCA CTC CAG GTC TAA CAC AGT TGA CGA CGG GTT 



Thr Phe Leu Ala Thr Cys He Asn Gly Val Cys 
3202 ACC TTC CTG GCA ACG TGC ATC AAT GGG GTG TGC 
TGG AAG GAC CGT TGC ACG TAG TTA CCC CAC ACG 



Trp Thr Val Tyr His Gly Ala Gly Thr Arg Thr 
3235 TGG ACT GTC TAC CAC GGG GCC GGA ACG AGG ACC 
ACC TGA CAG ATG GTG CCC CGG CCT TGC TCC TGG 



He Ala Ser Pro Lys Gly Pro Val He Gin Met 
3268 ATC GCG TCA CCC AAG GGT CCT GTC ATC CAG ATG 
TAG CGC AGT GGG TTC CCA GGA CAG TAG GTC TAC 



Tyr Thr Asn Val Asp Gin Asp Leu Val Gly Trp 
3301 TAT ACC AAT GTA GAC CAA GAC CTT GTG GGC TGG 
ATA TGG TTA CAT CTG GTT CTG GAA CAC CCG ACC 



Pro Ala Pro Gin Gly Ser Arg Ser Leu Thr Pro 
3334 CCC GCT CCG CAA GGT AGC CGC TCA TTG ACA CCC 
GGG CGA GGC GTT CCA TCG GCG AGT AAC TGT GGG 
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Cys Thr Cys Gly Ser Ser Asp Leu Tyr Leu Val 
TGC ACT TGC GGC TCC TCG GAC CTT TAC CTG GTC 
ACG TGA ACG CCG AGG AGC CTG GAA ATG GAC CAG 



Thr Arg His Ala Asp Val lie Pro Val Arg Arg 
ACG AGG CAC GCC GAT GTC ATT CCC GTG CGC CGG 
TGC TCC GTG CGG CTA CAG TAA GGG CAC GCG GCC 



Arg Gly Asp Ser Arg Gly Ser Leu Leu Ser Pro 
CGG GGT GAT AGC AGG GGC AGC CTG CTG TCG CCC 
GCC CCA CTA TCG TCC CCG TCG GAC GAC AGC GGG 



Arg Pro lie Ser Tyr Leu Lys Gly Ser Ser Gly 
CGG CCC ATT TCC TAC TTG AAA GGC TCC TCG GGG 
GCC GGG TAA AGG ATG AAC TTT CCG AGG AGC CCC 



Gly Pro Leu Leu Cys Pro Ala Gly His Ala Val 
GGT CCG CTG TTG TGC CCC GCG GGG CAC GCC GTG 
CCA GGC GAC AAC ACG GGG CGC CCC GTG CGG CAC 



Gly lie Phe Arg Ala Ala Val Cys Thr Arg Gly 
GGC ATA TTT AGG GCC GCG GTG TGC ACC CGT GGA 
CCG TAT AAA TCC CGG CGC CAC ACG TGG GCA CCT 



Val Ala Lys Ala Val Asp Phe lie Pro Val Glu 
GTG GCT AAG GCG GTG GAC TTT ATC CCT GTG GAG 
CAC CGA TTC CGC CAC CTG AAA TAG GGA CAC CTC 



Asn Leu Glu Thr Thr Met Arg Ser Pro Val Phe 
AAC CTA GAG ACA ACC ATG AGG TCC CCG GTG TTC 
TTG GAT CTC TGT TGG TAC TCC AGG GGC CAC AAG 



Thr Asp Asn Ser Ser Pro Pro Val Val Pro Gin 
ACG GAT AAC TCC TCT CCA CCA GTA GTG CCC CAG 
TGC CTA TTG AGG AGA GGT GGT CAT CAC GGG GTC 
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Ser Phe Gin Val Ala His Leu His Ala Pro Thr 
AGC TTC CAG GTG GCT CAC CTC CAT GCT CCC ACA 
TCG AAG GTC CAC CGA GTG GAG GTA CGA GGG TGT 



Gly Ser Gly Lys Ser Thr Lys Val Pro Ala Ala 
GGC AGC GGC AAA AGC ACC AAG GTC CCG GCT GCA 
CCG TCG CCG TTT TCG TGG TTC CAG GGC CGA CGT 



Tyr Ala Ala Gin Gly Tyr Lys Val Leu Val Leu 
TAT GCA GCT CAG GGC TAT AAG GTG CTA GTA CTC 
ATA CGT CGA GTC CCG ATA TTC CAC GAT CAT GAG 



Asn Pro Ser Val Ala Ala Thr Leu Gly Phe Gly 
AAC CCC TCT GTT GCT GCA ACA CTG GGC TTT GGT 
TTG GGG AGA CAA CGA CGT TGT GAC CCG AAA CCA 



Ala Tyr Met Ser Lys Ala His Gly He Asp Pro 
GCT TAC ATG TCC AAG GCT CAT GGG ATC GAT CCT 
CGA ATG TAC AGG TTC CGA GTA CCC TAG CTA GGA 



Asn He Arg Thr Gly Val Arg Thr He Thr Thr 
AAC ATC AGG ACC GGG GTG AGA ACA ATT ACC ACT 
TTG TAG TCC TGG CCC CAC TCT TGT TAA TGG TGA 



Gly Ser Pro He Thr Tyr Ser Thr Tyr Gly Lys 
GGC AGC CCC ATC ACG TAC TCC ACC TAC GGC AAG 
CCG TCG GGG TAG TGC ATG AGG TGG ATG CCG TTC 



Phe Leu Ala Asp Gly Gly Cys Ser Gly Gly Ala 

TTC CTT GCC GAC GGC GGG TGC TCG GGG GGC GCT 
AAG GAA CGG CTG CCG CCC ACG AGC CCC CCG CGA 



Tyr Asp He He He Cys Asp Glu Cys His Ser 
TAT GAC ATA ATA ATT TGT GAC GAG TGC CAC TCC 
ATA CTG TAT TAT TAA ACA CTG CTC ACG GTG AGG 
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Thr Asp Ala Thr Ser He Leu Gly He Gly Thr 
3961 ACG GAT GCC ACA TCC ATC TTG GGC ATC GGC ACT 
TGC CTA CGG TGT AGG TAG AAC CCG TAG CCG TGA 



Val Leu Asp Gin Ala Glu Thr Ala Gly Ala Arg 
3994 GTC CTT GAC CAA GCA GAG ACT GCG GGG GCG AGA 
CAG GAA CTG GTT CGT CTC TGA CGC CCC CGC TCT 



Leu Val Val Leu Ala Thr Ala Thr Pro Pro Gly 
4027 CTG GTT GTG CTC GCC ACC GCC ACC CCT CCG GGC 
GAC CAA CAC GAG CGG TGG CGG TGG GGA GGC CCG 



Ser Val Thr Val Pro His Pro Asn He Glu Glu 
4060 TCC GTC ACT GTG CCC CAT CCC AAC ATC GAG GAG 
AGG CAG TGA CAC GGG GTA GGG TTG TAG CTC CTC 



Val Ala Leu Ser Thr Thr Gly Glu He Pro Phe 
4093 GTT GCT CTG TCC ACC ACC GGA GAG ATC CCT TTT 
CAA CGA GAC AGG TGG TGG CCT CTC TAG GGA AAA 



Tyr Gly Lys Ala He Pro Leu Glu Val He Lys 
4126 TAC GGC AAG GCT ATC CCC CTC GAA GTA ATC AAG 
ATG CCG TTC CGA TAG GGG GAG CTT CAT TAG TTC 



Gly Gly Arg His Leu He Phe Cys His Ser Lys 
4159 GGG GGG AGA CAT CTC ATC TTC TGT CAT TCA AAG 
CCC CCC TCT GTA GAG TAG AAG ACA GTA AGT TTC 



Lys Lys Cys Asp Glu Leu Ala Ala Lys Leu Val 
4192 C AAG AAG TGC GAC GAA CTC GCC GCA AAG CTG GTC 
TTC TTC ACG CTG CTT GAG CGG CGT TTC GAC CAG 



Ala Leu Gly He Asn Ala Val Ala Tyr Tyr Arg 
4225 GCA TTG GGC ATC AAT GCC GTG GCC TAC TAC CGC 
CGT AAC CCG TAG TTA CGG CAC CGG ATG ATG GCG 
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Gly Leu Asp Val Ser Val He Pro Thr Ser Gly 
4258 GGT CTT GAC GTG TCC GTC ATC CCG ACC AGC GGC 
CCA GAA CTG CAC AGG CAG TAG GGC TGG TCG CCG 



Asp Val Val Val Val Ala Thr Asp Ala Leu Met 
4291 GAT GTT GTC GTC GTG GCA ACC GAT GCC CTC ATG 
CTA CAA CAG CAG CAC CGT TGG CTA CGG GAG TAC 



Thr Gly Tyr Thr Gly Asp Phe Asp Ser Val He 
4324 ACC GGC TAT ACC GGC GAC TTC GAC TCG GTG ATA 
TGG CCG ATA TGG CCG CTG AAG CTG AGC CAC TAT 



Asp Cys Asn Thr Cys Val Thr Gin Thr Val Asp 
4357 GAC TGC AAT ACG TGT GTC ACC CAG ACA GTC GAT 
CTG ACG TTA TGC ACA CAG TGG GTC TGT CAG CTA 



Phe Ser Leu Asp Pro Thr Phe Thr He Glu Thr 
4390 TTC AGC CTT GAC CCT ACC TTC ACC ATT GAG ACA 
AAG TCG GAA CTG GGA TGG AAG TGG TAA CTC TGT 



He Thr Leu Pro Gin Asp Ala Val Ser Arg Thr 
4423 ATC ACG CTC CCC CAG GAT GCT GTC TCC CGC ACT 
TAG TGC GAG GGG GTC CTA CGA CAG AGG GCG TGA 



Gin Arg Arg Gly Arg Thr Gly Arg Gly Lys Pro 
4456 CAA CGT CGG GGC AGG ACT GGC AGG GGG AAG CCA 
GTT GCA GCC CCG TCC TGA CCG TCC CCC TTC GGT 



Gly He Tyr Arg Phe Val Ala Pro Gly Glu Arg 
4489 GGC ATC TAC AGA TTT GTG GCA CCG GGG GAG CGC 
CCG TAG ATG TCT AAA CAC CGT GGC CCC CTC GCG 



Pro Ser Gly Met Phe Asp Ser Ser Val Leu Cys 
4522 CCC TCC GGC ATG TTC GAC TCG TCC GTC CTC TGT 
-GGG AGG CCG TAC AAG CTG AGC AGG CAG GAG ACA 
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Glu Cys Tyr Asp Ala Gly Cys Ala Trp Tyr Glu 
GAG TGC TAT GAC GCA GGC TGT GCT TGG TAT GAG 
CTC ACG ATA CTG CGT CCG ACA CGA ACC ATA CTC 



Leu Thr Pro Ala Glu Thr Thr Val Arg Leu Arg 
CTC ACG CCC GCC GAG ACT ACA GTT AGG CTA CGA 
GAG TGC GGG CGG CTC TGA TGT CAA TCC GAT GCT 



Ala Tyr Met Asn Thr Pro Gly Leu Pro Val Cys 
GCG TAC ATG AAC ACC CCG GGG CTT CCC GTG TGC 
CGC ATG TAC TTG TGG GGC CCC GAA GGG CAC ACG 



Gin Asp His Leu Glu Phe Trp Glu Gly Val Phe 
CAG GAC CAT CTT GAA TTT TGG GAG GGC GTC TTT 
GTC CTG GTA GAA CTT AAA ACC CTC CCG CAG AAA 



Thr Gly Leu Thr His lie Asp Ala His Phe Leu 
ACA GGC CTC ACT CAT ATA GAT GCC CAC TTT CTA 
TGT CCG GAG TGA GTA TAT CTA CGG GTG AAA GAT 



Ser Gin Thr Lys Gin Ser Gly Glu Asn Leu Pro 
TCC CAG ACA AAG CAG AGT GGG GAG AAC CTT CCT 
AGG GTC TGT TTC GTC TGA CCC CTC TTG GAA GGA 



Tyr Leu Val Ala Tyr Gin Ala Thr Val Cys Ala 
TAC CTG GTA GCG TAC CAA GCC ACC GTG TGC GCT 
ATG GAC CAT CGC ATG GTT CGG TGG CAC ACG CGA 



Arg Ala Gin Ala Pro Pro Pro Ser Trp Asp Gin 
AGG GCT CAA GCC CCT CCC CCA TCG TGG GAC CAG 
TCC CGA GTT CGG GGA GGG GGT AGC ACC CTG GTC 



Met Trp Lys Cys Leu lie Arg Leu Lys Pro Thr 
ATG TGG AAG TGT TTG ATT CGC CTC AAG CCC ACC 
TAC ACC TTC ACA AAC TAA GCG GAG TTC GGG TGG 
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Leu His Gly Pro Thr Pro Leu Leu Tyr Arg Leu 
4852 CTC CAT GGG CCA ACA CCC CTG CTA TAC AGA CTG 
GAG GTA CCC GGT TGT GGG GAC GAT ATG TCT GAC 



Gly Ala Val Gin Asn Glu lie Thr Leu Thr His 
4885 GGC GCT GTT CAG AAT GAA ATC ACC CTG ACG CAC 
CCG CGA CAA GTC TTA CTT TAG TGG GAC TGC GTG 



Pro Val Thr Lys Tyr He Met Thr Cys Met Ser 
4918 CCA GTC ACC AAA TAC ATC ATG ACA TGC ATG TCG 
GGT CAG TGG TTT ATG TAG TAC TGT ACG TAC AGC 



Ala Asp Leu Glu Val Val Thr Ser Thr Trp Val 
4951 GCC GAC CTG GAG GTC GTC ACG AGC ACC TGG GTG 
CGG CTG GAC CTC CAG CAG TGC TCG TGG ACC CAC 



Leu Val Gly Gly Val Leu Ala Ala Leu Ala Ala 
4984 CTC GTT GGC GGC GTC CTG GCT GCT TTG GCC GCG 
GAG CAA CCG CCG CAG GAC CGA CGA AAC CGG CGC 



Tyr Cys Leu Ser Thr Gly Cys Val Val He Val 
5017 TAT TGC CTG TCA ACA GGC TGC GTG GTC ATA GTG 
ATA ACG GAC AGT TGT CCG ACG CAC CAG TAT CAC 

Gly . Arg Val Val Leu Ser Gly Lys Pro Ala He 
5050 GGC AGG GTC GTC TTG TCC GGG AAG CCG GCA ATC 
CCG TCC CAG CAG AAC AGG CCC TTC GGC CGT TAG 



He Pro Asp Arg Glu Val Leu Tyr Arg Glu Phe 
5083 ATA CCT GAC AGG GAA GTC CTC TAC CGA GAG TTC 
TAT GGA CTG TCC CTT CAG GAG ATG GCT CTC AAG 



Asp Glu Met Glu Glu Cys Ser Gin His Leu Pro 
5116 GAT GAG ATG GAA GAG TGC TCT CAG CAC TTA CCG 
CTA CTC TAC CTT CTC ACG AGA GTC GTG AAT GGC 
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Tyr lie Glu Gin Gly Met Met Leu Ala Glu Gin 
TAC ATC GAG CAA GGG ATG ATG CTC GCC GAG CAG 
ATG TAG CTC GTT CCC TAC TAC GAG CGG CTC GTC 



Phe Lys Gin Lys Ala Leu Gly Leu Leu Gin Thr 
TTC AAG CAG AAG GCC CTC GGC CTC CTG CAG ACC 
AAG TTC GTC TTC CGG GAG CCG GAG GAC GTC TGG 



Ala Ser Arg Gin Ala Glu Val lie Ala Pro Ala 

GCG TCC CGT CAG GCA GAG GTT ATC GCC CCT GCT 
CGC AGG GCA GTC CGT CTC CAA TAG CGG GGA CGA 



Val Gin Thr Asn Trp Gin Lys Leu Glu Thr Phe 
GTC CAG ACC AAC TGG CAA AAA CTC GAG ACC TTC 
CAG GTC TGG TTG ACC GTT TTT GAG CTC TGG AAG 



Trp Ala Lys His Met Trp Asn Phe lie Ser Gly 
TGG GCG AAG CAT ATG TGG AAC TTC ATC AGT GGG 
ACC CGC TTC GTA TAC ACC TTG AAG TAG TCA CCC 



lie Gin Tyr Leu Ala Gly Leu Ser Thr Leu Pro 
ATA CAA TAC TTG GCG GGC TTG TCA ACG CTG CCT 
TAT GTT ATG AAC CGC CCG AAC AGT TGC GAC GGA 



Gly Asn Pro Ala lie Ala Ser Leu Met Ala Phe 
GGT AAC CCC GCC ATT GCT TCA TTG ATG GCT TTT 
CCA TTG GGG CGG TAA CGA AGT AAC TAC CGA AAA 



Thr Ala Ala Val Thr Ser Pro Leu Thr Thr Ser 
ACA GCT GCT GTC ACC AGC CCA CTA ACC ACT AGC 
TGT CGA CGA CAG TGG TCG GGT GAT TGG TGA TCG 



Gin Thr Leu Leu Phe Asn lie Leu Gly Gly Trp 
CAA ACC CTC CTC TTC AAC ATA TTG GGG GGG TGG 
GTT TGG GAG GAG AAG TTG TAT AAC CCC CCC ACC 
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Val Ala Ala Gin Leu Ala Ala Pro Gly Ala Ala 
GTG GCT GCC CAG CTC GCC GCC CCC GGT GCC GCT 
CAC CGA CGG GTC GAG CGG CGG GGG CCA CGG CGA 



Thr Ala Phe Val Gly Ala Gly Leu Ala Gly Ala 
ACT GCC TTT GTG GGC GCT GGC TTA GCT GGC GCC 
TGA CGG AAA CAC CCG CGA CCG AAT CGA CCG CGG 



Ala He Gly Ser Val Gly Leu Gly Lys Val Leu 
GCC ATC GGC AGT GTT GGA CTG GGG AAG GTC CTC 
CGG TAG CCG TCA CAA CCT GAC CCC TTC CAG GAG 



He Asp He Leu Ala Gly Tyr Gly Ala Gly Val 
ATA GAC ATC CTT GCA GGG TAT GGC GCG GGC GTG 
TAT CTG TAG GAA CGT CCC ATA CCG CGC CCG CAC 



Ala Gly Ala Leu Val Ala Phe Lys He Met Ser 
GCG GGA GCT CTT GTG GCA TTC AAG ATC ATG AGC 
CGC CCT CGA GAA CAC CGT AAG TTC TAG TAC TCG 



Gly Glu Val Pro Ser Thr Glu Asp Leu Val Asn 
GGT GAG GTC CCC TCC ACG GAG GAC CTG GTC AAT 
CCA CTC CAG GGG AGG TGC CTC CTG GAC CAG TTA 



Leu Leu Pro Ala He Leu Ser Pro Gly Ala Leu 
CTA CTG CCC GCC ATC CTC TCG CCC GGA GCC CTC 
GAT GAC GGG CGG TAG GAG AGC GGG CCT CGG GAG 



Val Val Gly Val Val Cys Ala Ala He Leu Arg 
GTA GTC GGC GTG GTC TGT GCA GCA ATA CTG CGC 
CAT CAG CCG CAC CAG ACA CGT CGT TAT GAC GCG 



Arg His Val Gly Pro Gly Glu Gly Ala Val Gin 
CGG CAC GTT GGC CCG GGC GAG GGG GCA GTG CAG 
GCC GTG CAA CCG GGC CCG CTC CCC CGT CAC GTC 



FIG. 12-20 



SUBSTITUTE SHEET 



WO 91/04262 



PCI/US90/05242 



48/ 79 



Trp Met Asn Arg Leu lie Ala Phe Ala Ser Arg 

5743 TGG ATG AAC CGG CTG ATA GCC TTC GCC TCC CGG 
ACC TAC TTG GCC GAC TAT CGG AAG CGG AGG GCC 



Gly Asn His Val Ser Pro Thr His Tyr Val Pro 
5776 GGG AAC CAT GTT TCC CCC ACG CAC TAC GTG CCG 
CCC TTG GTA CAA AGG GGG TGC GTG ATG CAC GGC 



Glu Ser Asp Ala Ala Ala Arg Val Thr Ala He 
5809 GAG AGC GAT GCA GCT GCC CGC GTC ACT GCC ATA 
CTC TCG CTA CGT CGA CGG GCG CAG TGA CGG TAT 



Leu Ser Ser Leu Thr Val Thr Gin Leu Leu Arg 
5842 CTC AGC AGC CTC ACT GTA ACC CAG CTC CTG AGG 
GAG TCG TCG GAG TGA CAT TGG GTC GAG GAC TCC 



Arg Leu His Gin Trp He Ser Ser Glu Cys Thr 
5875 CGA CTG CAC CAG TGG ATA AGC TCG GAG TGT ACC 
GCT GAC GTG GTC ACC TAT TCG AGC CTC ACA TGG 



Thr Pro Cys Ser Gly Ser Trp Leu Arg Asp He 
5908 ACT CCA TGC TCC GGT TCC TGG CTA AGG GAC ATC 
TGA GGT ACG AGG CCA AGG ACC GAT TCC CTG TAG 



Trp Asp Trp He Cys Glu Val Leu Ser Asp Phe 
5941 TGG GAC TGG ATA TGC GAG GTG TTG AGC GAC TTT 
ACC CTG ACC TAT ACG CTC CAC AAC TCG CTG AAA 



Lys Thr Trp Leu Lys Ala Lys Leu Met Pro Gin 
5974 AAG ACC TGG CTA AAA GCT AAG CTC ATG CCA CAG 
TTC TGG ACC GAT TTT CGA TTC GAG TAC GGT GTC 



Leu Pro Gly He Pro Phe Val Ser Cys Gin Arg 
6007 CTG CCT GGG ATC CCC TTT GTG TCC TGC CAG CGC 
GAC GGA CCC TAG GGG AAA CAC AGG ACG GTC GGG 
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Gly Tyr Lys Gly Val Trp Arg Val Asp Gly lie 
GGG TAT AAG GGG GTC TGG CGA GTG GAC GGC ATC 
CCC ATA TTC CCC CAG ACC GCT CAC CTG CCG TAG 



Met His Thr Arg Cys His Cys Gly Ala Glu lie 
ATG CAC ACT CGC TGC CAC TGT GGA GCT GAG ATC 
TAC GTG TGA GCG ACG GTG ACA CCT CGA CTC TAG 



Thr Gly His Val Lys Asn Gly Thr Met Arg lie 

ACT GGA CAT GTC AAA AAC GGG ACG ATG AGG ATC 
TGA CCT GTA CAG TTT TTG CCC TGC TAC TCC TAG 



Val Gly Pro Arg Thr Cys Arg Asn Met Trp Ser 
GTC GGT CCT AGG ACC TGC AGG AAC ATG TGG AGT 
CAG CCA GGA TCC TGG ACG TCC TTG TAC ACC TCA 



Gly Thr Phe Pro lie Asn Ala Tyr Thr Thr Gly 
GGG ACC TTC CCC ATT AAT GCC TAC ACC ACG GGC 
CCC TGG AAG GGG TAA TTA CGG ATG TGG TGC CCG 



Pro Cys Thr Pro Leu Pro Ala Pro Asn Tyr Thr 
CCC TGT ACC CCC CTT CCT GCG CCG AAC TAC ACG 
GGG ACA TGG GGG GAA GGA CGC GGC TTG ATG TGC 



Phe Ala Leu Trp Arg Val Ser Ala Glu Glu Tyr 
TTC GCG CTA TGG AGG GTG TCT GCA GAG GAA TAT 
AAG CGC GAT ACC TCC CAC AGA CGT CTC CTT ATA 



Val Glu lie Arg Gin Val Gly Asp Phe His Tyr 
GTG GAG ATA AGG CAG GTG GGG GAC TTC CAC TAC 
CAC CTC TAT TCC GTC CAC CCC CTG AAG GTG ATG 



Val Thr Gly Met Thr Thr Asp Asn Leu Lys Cys 
GTG ACG GGT ATG ACT ACT GAC AAT CTC AAA TGC 
CAC TGC CCA TAC TGA TGA CTG TTA GAG TTT ACG 
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Pro Cys Gin Val Pro Ser Pro Glu Phe Phe Thr 
CCG TGC CAG GTC CCA TCG CCC GAA TTT TTC ACA 
GGC ACG GTC CAG GGT AGC GGG CTT AAA AAG TGT 



Glu Leu Asp Gly Val Arg Leu His Arg Phe Ala 

GAA TTG GAC GGG GTG CGC CTA CAT AGG TTT GCG 
CTT AAC CTG CCC CAC GCG GAT GTA TCC AAA CGC 



Pro Pro Cys Lys Pro Leu Leu Arg Glu Glu Val 
CCC CCC TGC AAG CCC TTG CTG CGG GAG GAG GTA 
GGG GGG ACG TTC GGG AAC GAC GCC CTC CTC CAT 



Ser Phe Arg Val Gly Leu His Glu Tyr Pro Val 
TCA TTC AGA GTA GGA CTC CAC GAA TAC CCG GTA 
AGT AAG TCT CAT CCT GAG GTG CTT ATG GGC CAT 



Gly Ser Gin Leu Pro Cys Glu Pro Glu Pro Asp 
GGG TCG CAA TTA CCT TGC GAG CCC GAA CCG GAC 
CCC AGC GTT AAT GGA ACG CTC GGG CTT GGC CTG 



Val Ala Val Leu Thr Ser Met Leu Thr Asp Pro 
GTG GCC GTG TTG ACG TCC ATG CTC ACT GAT CCC 
CAC CGG CAC AAC TGC AGG TAC GAG TGA CTA GGG 



Ser His lie Thr Ala Glu Ala Ala Gly Arg Arg 
TCC CAT ATA ACA GGA GAG GCG GCC GGG CGA AGG 
AGG GTA TAT TGT CGT CTC CGC CGG CCC GCT TCC 



Leu Ala Arg Gly Ser Pro Pro Ser Val Ala Ser 
TTG GCG AGG GGA TCA CCC CCC TCT GTG GCC AGC 
AAC CGC TCC CCT AGT GGG GGG AGA CAC CGG TCG 



Ser Ser Ala Ser Gin Leu Ser Ala Pro Ser Leu 
TCC TCG GCT AGC CAG CTA TCC GCT CCA TCT CTC 
AGG AGC CGA TCG GTC GAT AGG CGA GGT AGA GAG 
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Lys Ala Thr Cys Thr Ala Asn His Asp Ser Pro 
AAG GCA ACT TGC ACC GCT AAC CAT GAC TCC CCT 
TTC CGT TGA ACG TGG CGA TTG GTA CTG AGG GGA 



Asp Ala Glu Leu lie Glu Ala Asn Leu Leu Trp 
GAT GCT GAG CTC ATA GAG GCC AAC CTC CTA TGG 
CTA CGA CTC GAG TAT CTC CGG TTG GAG GAT ACC 



Arg Gin Glu Met Gly Gly Asn lie Thr Arg Val 
AGG CAG GAG ATG GGC GGC AAC ATC ACC AGG GTT 
TCC GTC CTC TAC CCG CCG TTG TAG TGG TCC CAA 



Glu Ser Glu Asn Lys Val Val lie Leu Asp Ser 
GAG TCA GAA AAC AAA GTG GTG ATT CTG GAC TCC 
CTC AGT CTT TTG TTT CAC CAC TAA GAC CTG AGG 



Phe Asp Pro Leu Val Ala Glu Glu Asp Glu Arg 
TTC GAT CCG CTT GTG GCG GAG GAG GAC GAG CGG 
AAG CTA GGC GAA CAC CGC CTC CTC CTG CTC GCC 



Glu lie Ser Val Pro Ala Glu He Leu Arg Lys 
GAG ATC TCC GTA CCC GCA GAA ATC CTG CGG AAG 
CTC TAG AGG CAT GGG CGT CTT TAG GAC GCC TTC 



Ser Arg Arg Phe Ala Gin Ala Leu Pro Val Trp 
TCT CGG AGA TTC GCC CAG GCC CTG CCC GTT TGG 
AGA GCC TCT AAG CGG GTC CGG GAC GGG CAA ACC 



Ala Arg Pro Asp Tyr Asn Pro Pro Leu Val Glu 
GCG CGG CCG GAC TAT AAC CCC CCG CTA GTG GAG 
CGC GCC GGC CTG ATA TTG GGG GGC GAT CAC CTC 



Thr Trp Lys Lys Pro Asp Tyr Glu Pro Pro Val 
ACG TGG AAA AAG CCC GAC TAC GAA CCA CCT GTG 
TGC ACC TTT TTC GGG CTG ATG CTT GGT GGA CAC 
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Val His Gly Cys Pro Leu Pro Pro Pro Lys Ser 
GTC CAT GGC TGT CCG CTT CCA CCT CCA AAG TCC 
CAG GTA CCG ACA GGC GAA GGT GGA GGT TTC AGG 



Pro Pro Val Pro Pro Pro Arg Lys Lys Arg Thr 
CCT CCT GTG CCT CCG CCT CGG AAG AAG CGG ACG 
GGA GGA CAC GGA GGC GGA GCC TTC TTC GCC TGC 



Val Val Leu Thr Glu Ser Thr Leu Ser Thr Ala 

GTG GTC CTC ACT GAA TCA ACC CTA TCT ACT GCC 
CAC CAG GAG TGA CTT AGT TGG GAT AGA TGA CGG 



Leu Ala Glu Leu Ala Thr Arg Ser Phe Gly Ser 
TTG GCC GAG CTC GCC ACC AGA AGC TTT GGC AGC 
AAC CGG CTC GAG CGG TGG TCT TCG AAA CCG TCG 



Ser Ser Thr Ser Gly lie Thr Gly Asp Asn Thr 
TCC TCA ACT TCC GGC ATT ACG GGC GAC AAT ACG 
AGG AGT TGA AGG CCG TAA TGC CCG CTG TTA TGC 



Thr Thr Ser Ser Glu Pro Ala Pro Ser Gly Cys 
ACA ACA TCC TCT GAG CCC GCC CCT TCT GGC TGC 
TGT TGT AGG AGA CTC GGG CGG GGA AGA CCG ACG 



Pro Pro Asp Ser Asp Ala Glu Ser Tyr Ser Ser 
CCC CCC GAC TCC GAC GCT GAG TCC TAT TCC TCC 
GGG GGG CTG AGG CTG CGA CTC AGG ATA AGG AGG 



Met Pro Pro Leu Glu Gly Glu Pro Gly Asp Pro 
ATG CCC CCC CTG GAG GGG GAG CCT GGG GAT CCG 
TAC GGG GGG GAC CTC CCC CTC GGA CCC CTA GGC 



Asp Leu Ser Asp Gly Ser Trp Ser Thr Val Ser 
GAT CTT AGC GAC GGG TCA TGG TCA ACG GTC AGT 
CTA GAA TCG CTG CCC AGT ACC AGT TGC CAG TCA 
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Ser Glu Ala Asn Ala Glu Asp Val Val Cys Cys 
AGT GAG GCC AAC GCG GAG GAT GTC GTG TGC TGC 
TCA CTC CGG TTG CGC CTC CTA CAG CAC ACG ACG 



Ser Met Ser Tyr Ser Trp Thr Gly Ala Leu Val 
TCA ATG TCT TAC TCT TGG ACA GGC GCA CTC GTC 
AGT TAC AGA ATG AGA ACC TGT CCG CGT GAG CAG 



Thr Pro Cys Ala Ala Glu Glu Gin Lys Leu Pro 
ACC CCG TGC GCC GCG GAA GAA CAG AAA CTG CCC 
TGG GGC ACG CGG CGC CTT CTT GTC TTT GAC GGG 



lie Asn Ala Leu Ser Asn Ser Leu Leu Arg His 
ATC AAT GCA CTA AGC AAC TCG TTG CTA CGT CAC 
TAG TTA CGT GAT TCG TTG AGC AAC GAT GCA GTG 



His Asn Leu Val Tyr Ser Thr Thr Ser Arg Ser 
CAC AAT TTG GTG TAT TCC ACC ACC TCA CGC AGT 
GTG TTA AAC CAC ATA AGG TGG TGG AGT GCG TCA 



Ala Cys Gin Arg Gin Lys Lys Val Thr Phe Asp 
GCT TGC CAA AGG CAG AAG AAA GTC ACA TTT GAC 
CGA ACG GTT TCC GTC TTC TTT CAG TGT AAA CTG 



Arg Leu Gin Val Leu Asp Ser His Tyr Gin Asp 
AGA CTG CAA GTT CTG GAC AGC CAT TAC CAG GAC 
TCT GAC GTT CAA GAC CTG TCG GTA ATG GTC CTG 



Val Leu Lys Glu Val Lys Ala Ala Ala Ser Lys 
GTA CTC AAG GAG GTT AAA GCA GCG GCG TCA AAA 
CAT GAG TTC CTC CAA TTT CGT CGC CGC AGT TTT 



Val Lys Ala Asn Leu Leu Ser Val Glu Glu Ala 
GTG AAG GCT AAC TTG CTA TCC GTA GAG GAA GCT 
CAC TTC CGA TTG AAC GAT AGG CAT CTC CTT CGA 
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TGC AGC CTG ACG CCC CCA CAC TCA GCC AAA TCC 
ACG TCG GAC TGC GGG GGT GTG AGT CGG TTT AGG 



Lys Phe Gly Tyr Gly Ala Lys Asp Val Arg Cys 
AAG TTT GGT TAT GGG GCA AAA GAC GTC CGT TGC 
TTC AAA CCA ATA CCC CGT TTT CTG CAG GCA ACG 



His Ala Arg Lys Ala Val Thr His He Asn Ser 
CAT GCC AGA AAG GCC GTA ACC CAC ATC AAC TCC 
GTA CGG TCT TTC CGG CAT TGG GTG TAG TTG AGG 



Val Trp Lys Asp Leu Leu Glu Asp Asn Val Thr 
GTG TGG AAA GAC CTT CTG GAA GAC AAT GTA ACA 
CAC ACC TTT CTG GAA GAC CTT CTG TTA CAT TGT 



Pro He Asp Thr Thr He Met Ala Lys Asn Glu 
CCA ATA GAC ACT ACC ATC ATG GCT AAG AAC GAG 
GGT TAT CTG TGA TGG TAG TAC CGA TTC TTG CTC 



Val Phe Cys Val Gin Pro Glu Lys Gly Gly Arg 
GTT TTC TGC GTT CAG CCT GAG AAG GGG GGT CGT 
CAA AAG ACG CAA GTC GGA CTC TTC CCC CCA GCA 



Lys Pro Ala Arg Leu He Val Phe Pro Asp Leu 
AAG CCA GCT CGT CTC ATC GTG TTC CCC GAT CTG 
TTC GGT CGA GCA GAG TAG CAC AAG GGG CTA GAC 



Gly Val Arg Val Cys Glu Lys Met Ala Leu Tyr 
GGC GTG CGC GTG TGC GAA AAG ATG GCT TTG TAC 
CCG CAC GCG CAC ACG CTT TTC TAC CGA AAC ATG 



Asp Val Val Thr Lys Leu Pro Leu Ala Val Met 
GAC GTG GTT ACA AAG CTC CCC TTG GCC GTG ATG 
CTG CAC CAA TGT TTC GAG GGG AAC CGG CAC TAC 
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55/79 

Gly Ser Ser Tyr Gly Phe Gin Tyr Ser Pro Gly 
7822 GGA AGC TCC TAC GGA TTC CAA TAC TCA CCA GGA 
CCT TCG AGG ATG CCT AAG GTT ATG AGT GGT CCT 



Gin Arg Val Glu Phe Leu Val Gin Ala Trp Lys 
7855 CAG CGG GTT GAA TTC CTC GTG CAA GCG TGG AAG 
GTC GCC CAA CTT AAG GAG CAC GTT CGC ACC TTC 



Ser Lys Lys Thr Pro Met Gly Phe Ser Tyr Asp 
7888 TCC AAG AAA ACC CCA ATG GGG TTC TCG TAT GAT 
AGG TTC TTT TGG GGT TAC CCC AAG AGC ATA CTA 



Thr Arg Cys Phe Asp Ser Thr Val Thr Glu Ser 
7921 ACC CGC TGC TTT GAC TCC ACA GTC ACT GAG AGC 
TGG GCG ACG AAA CTG AGG TGT CAG TGA CTC TCG 



Asp lie Arg Thr Glu Glu Ala lie Tyr Gin Cys 
7954 GAC ATC CGT ACG GAG GAG GCA ATC TAC CAA TGT 
CTG TAG GCA TGC CTC CTC CGT TAG ATG GTT ACA 



Cys Asp Leu Asp Pro Gin Ala Arg Val Ala lie 
7987 TGT GAC CTC GAC CCC CAA GCC CGC GTG GCC ATC 
ACA CTG GAG CTG GGG GTT CGG GCG CAC CGG TAG 



Lys Ser Leu Thr Glu Arg Leu Tyr Val Gly Gly 
8020 AAG TCC CTC ACC GAG AGG CTT TAT GTT GGG GGC 
TTC AGG GAG TGG CTC TCC GAA ATA CAA CCC CCG 



Pro Leu Thr Asn Ser Arg Gly Glu Asn Cys Gly 
8053 CCT CTT ACC AAT TCA AGG GGG GAG AAC TGC GGC 
GGA GAA TGG TTA AGT TCC CCC CTC TTG ACG CCG 



Tyr Arg Arg Cys Arg Ala Ser Gly Val Leu Thr 
8086 TAT CGC AGG TGC CGC GCG AGC GGC GTA CTG ACA 
ATA GCG TCC ACG GCG CGC TCG CCG CAT GAC TGT 

FIG. 12-28 
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Thr Ser Cys Gly Asn Thr Leu Thr Cys Tyr lie 
8119 ACT AGC TGT GGT AAC ACC CTC ACT TGC TAC ATC 
TGA TCG ACA CCA TTG TGG GAG TGA ACG ATG TAG 



Lys Ala Arg Ala Ala Cys Arg Ala Ala Gly Leu 

8152 AAG GCC CGG GCA GCC TGT CGA GCC GCA GGG CTC 
TTC CGG GCC CGT CGG ACA GCT CGG CGT CCC GAG 

Gin Asp Cys Thr Met Leu Val Cys Gly Asp Asp 
8185 CAG GAC TGC ACC ATG CTC GTG TGT GGC GAC GAC 
GTC CTG ACG TGG TAC GAG CAC ACA CCG CTG CTG 



Leu Val Val lie Cys Glu Ser Ala Gly Val Gin 
8218 TTA GTC GTT ATC TGT GAA AGC GCG GGG GTC CAG 
AAT CAG CAA TAG ACA CTT TCG CGC CCC CAG GTC 



Glu Asp Ala Ala Ser Leu Arg Ala Phe Thr Glu 
8251 GAG GAC GCG GCG AGC CTG AGA GCC TTC ACG GAG 
CTC CTG CGC CGC TCG GAC TCT CGG AAG TGC CTC 



Ala Met Thr Arg Tyr Ser Ala Pro Pro Gly Asp 
8284 GCT ATG ACC AGG TAC TCC GCC CCC CCT GGG GAC 
CGA TAC TGG TCC ATG AGG CGG GGG GGA CCC CTG 



Pro Pro Gin Pro Glu Tyr Asp Leu Glu Leu lie 
8317 CCC CCA CAA CCA GAA TAC GAC TTG GAG CTC ATA 
GGG GGT GTT GGT CTT ATG CTG AAC CTC GAG TAT 



Thr Ser Cys Ser Ser Asn Val Ser Val Ala His 
8350 ACA TCA TGC TCC TCC AAC GTG TGA GTC GCC CAC 
TGT AGT ACG AGG AGG TTG CAC AGT CAG CGG GTG 



Asp Gly Ala 
8383 GAC GGC GCT 
CTG CCG CGA 



Gly Lys Arg Val 
GGA AAG AGG GTC 
CCT TTC TCC CAG 

FIG. 12-29 



Tyr Tyr Leu Thr 
TAC TAC CTC ACC 
ATG ATG GAG' TGG 
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8416 

> 

8449 



8482 



8515 



8548 



8581 



8614 



8647 



8680 



Arg Asp Pro Thr Thr Pro Leu Ala Arg Ala Ala 
CGT GAC CCT ACA ACC CCC CTC GCG AGA GCT GCG 
GCA CTG GGA TGT TGG GGG GAG CGC TCT CGA CGC 



Trp Glu Thr Ala Arg His Thr Pro Val Asn Ser 
TGG GAG ACA GCA AGA CAC ACT CCA GTC AAT TCC 
ACC CTC TGT CGT TCT GTG TGA GGT CAG TTA AGG 



Trp Leu Gly Asn lie lie Met Phe Ala Pro Thr 
TGG CTA GGC AAC ATA ATC ATG TTT GCC CCC ACA 
ACC GAT CCG TTG TAT TAG TAC AAA CGG GGG TGT 



Leu Trp Ala Arg Met He Leu Met Thr His Phe 
CTG TGG GCG AGG ATG ATA CTG ATG ACC CAT TTC 
GAC ACC CGC TCC TAC TAT GAC TAC TGG GTA AAG 



Phe Ser Val Leu He Ala Arg Asp Gin Leu Glu 
TTT AGC GTC CTT ATA GCC AGG GAC CAG CTT GAA 
AAA TCG CAG GAA TAT CGG TCC CTG GTC GAA CTT 



Gin Ala Leu Asp Cys Glu He Tyr Gly Ala Cys 
CAG GCC CTC GAT TGC GAG ATC TAC GGG GCC TGC 
GTC CGG GAG CTA ACG CTC TAG ATG CCC CGG ACG 



Tyr Ser He Glu Pro Leu Asp Leu Pro Pro He 
TAC TCC ATA GAA CCA CTT GAT CTA CCT CCA ATC 
ATG AGG TAT CTT GGT GAA CTA GAT GGA GGT TAG 



He Gin Arg Leu His Gly Leu Ser Ala Phe Ser 
ATT CAA AGA CTC CAT GGC CTC AGC GCA TTT TCA 
TAA GTT TCT GAG GTA CCG GAG TCG CGT AAA AGT 



Leu His Ser Tyr Ser Pro Gly Glu He Asn Arg 
CTC CAC AGT TAC TCT CCA GGT GAA ATT AAT AGG 
<5AG GTG TCA ATG AGA GGT CCA CTT TAA TTA TCC 



FIG. 12-3C^ 
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8713 



8746 



8779 



8812 



8845 



Val Ala Ala Cys Leu Arg Lys Leu Gly Val Pro 
GTG GCC GCA TGC CTC AGA AAA CTT GGG GTA CCG 
CAC CGG CGT ACG GAG TCT TTT GAA CCC CAT GGC 



Pro Leu Arg Ala Trp Arg His Arg Ala Arg Ser 
CCC TTG CGA GCT TGG AGA CAC CGG GCC CGG AGC 
GGG AAC GCT CGA ACC TCT GTG GCC CGG GCC TCG 



Val Arg Ala Arg Leu Leu Ala Arg Gly Gly Arg 
GTC CGC GCT AGG CTT CTG GCC AGA GGA GGC AGG 
CAG GCG CGA TCC GAA GAC CGG TCT CCT CCG TCC 



Ala Ala He Cys Gly Lys Tyr Leu Phe Asn Trp 
GCT GCC ATA TGT GGC AAG TAC CTC TTC AAC TGG 
CGA CGG TAT ACA CCG TTC ATG GAG AAG TTG ACC 



Ala Val Arg Thr Lys Leu Lys 
GCA GTA AGA ACA AAG CTC AAA C 
CGT CAT TCT TGT TTC GAG TTT G 



FIG. 12-31 
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59/ 7 9 primer J159S 

Jl ACTGCCCTGAACTGCAATGA 

PT G 



Ser Leu Lys Thr Gly Phe Leu Ala Ala 
jl 1 C TCC CTC AAA ACT GGG TTT CTT GCC GCG 

PT T AG C C C GG T G A G 

Asn Trp Gly 



Leu Phe Tyr Thr His Lys Phe Asn Ala 
Jl 29 CTG TTC TAC ACA CAC AAG TTC AAC GCG 

PT T T CAC T T 

His Ser 
primer 166A for Jl-1216 



Ser Gly Cys Pro Glu Arg Met Ala Ser 
Jl 56 TCC GGA TGC CCG GAG CGC ATG GCC AGC 

PT ACTT AGCA 

Leu 



Jl 
PT 



83 



Cys 


Arg 


Ser 


He 


Asp 


Lys 


Phe 


TGT 


CGC 


TCC 


ATT 


GAC 


AAG 


TTC 


C 


A 


C 


C 


AC 


G T 


T 






Pro 


Leu 


Thr 


Asp 







Gly 


Trp 


Gly 


Pro 


He 


Thr 


Tyr 


Ala 


Gin 


Jl 


110 GGA 


TGG 


GGT 


CCC 


ATC 


ACC 


TAT 


GCT 


CAA 


PT 


C 




C 


T 




GT 
Ser 




C 


A C 

Asn 



rt 







Pro 


Asp 


Asn 


Ser 


Asp 


Gin 


Arg 


Pro 


Tyr 


Jl 


137 


CCT 


GAC 


AAC 


TCG 


GAC 


CAG 


AGG 


CCG 


TAT 


PT 




GGA 


AG 


GG 


C C 






C C 


C 


C 






Gly 


Ser 


Glv 


Pro 
















Cys 


Trp 


His 


Tyr 


Ala 


Pro 


Arg 


Gin 


Cys 


Jl 


164 


TGC 


TGG 


CAC 


TAC 


GCA 


CCT 


CGA 


CAG 


TGT 


PT 












C C 


A 


AA 


CT 


" C 



Pro Lys Pro 
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Jl 

PT 


191 


Gly 
GGT 


He 
ATC 
T 


Val 
GTA 
G 


Pro 
CCC 


Ala 
GCG 


Ser 
TCG 
AA 

Ly_s 


Gin 
CAG 
AGT 
Ser 


Val 
GTG 


Cys 
TGC 
T 


Jl 
PT 


218 


Gly 
GGT 


Pro 
CCA 
G 


Val 
GTG 
A 


Tyr 
TAT 


Cys 
TGC 


Phe 
TTC 


Thr 
ACC 
T 


Pro 
CCA 
C 


Ser 
AGC 


Jl 
PT 


245 


Pro 
CCT 
C 


Val 
GTT 
G 


Val 
GTA 
G 


Val 
GTG 


Gly 
GGG 
A 


Thr 
ACG 


Thr 
ACC 


Asp 
GAT 
C 


Arg 
CGT 
A G 


Jl 
PT 


272 


Phe 
TTC 
CG 
Ser 


Gly Ala 
GGC GCC 
G 


Pro 
CCT 

c 


Thr 
ACG 
C 


Tyr 
TAT 
C 


Asn 
AAC 

G 
Ser 


Trp 
TGG 


Gly 
GGG 
T 


Jl 
PT 


299 


Asp 
GAC 
A 
Glu 


Asn 
AAT 


GlU 
GAG 
T 
Asp 


Thr Asp 
ACG GAC 


Val 
GTG 
C 


Leu 
CTG 
T C 
Phe 


Leu 
CTC 
G 

Val 


Leu 
CTA 
T 


Jl 
PT 


326 


Asn 
AAC 


Asn 
AAC 

T 


Thr 
ACG 
C 


Arg 
CGG 
A 


Pro 
CCC 
A 


Pro 
CCG 


His 
CAC 
TG 
Leu 


Gly 
GGC 


Asn 
AAC 
T 


Jl 


353 


Trp 
TGG 


Phe Gly 
TTC GGC 


Cys 
TGT 


Thr 
ACA 











p T T CTGGATGAACTCAACTGGATT 

primer 199A 

Nucleotide Match: 259/367 (70.6%) 

Amino Acid Match (stringent): 93/122 (76.2%) 

(relaxed): 111/122 (91.0%) 

FIG. 13-2 
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Prototype HCV (PT) sequences different from 
Japanese HCV (Jl) are shown. 

Relaxed amino acid match: Gly=Ala=Pro=Ser=Thr, 
Asp=Glu, Asn=Gln, 

Aug=Lys=His, Leu=Ile=Val=Met, Phe=Trp=Tyr . 
Underline, different amino acid in relaxed 



matching. 
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Core to NS1 vs. HCV-1 

jl Pro Leu Val 

HCV-1 T CCG CTC GTC 

A 



Gly Ala Pro Leu Gly Gly Ala Ala Arg 
Jl 11 GGC GCC CCC TTA GGG GGC GCT GCC AGG 



Ala Leu Ala His Gly Val Arg Val Leu 
Jl 38 GCC CTG GCA CAT GGT GTC CGG GTT CTG 



Jl 65 



Jl 92 



Jl 119 



Jl 146 



Jl 173 



Jl 200 





















Glu Asp Gly Val Asn Tyr Ala Thr Gly 


GAG 


GAC 


GGC 


GTG 


AAC 


TAT 


GCA 


ACA 


GGG 


Asn 


Leu 


Pro 


Gly 


Cys 


Ser 


Phe 


Ser 


He 


AAT 
— C 


TTG 
C-T 


CCC 
— T 


GGT 


TGC 


TCT 


TTC 


TCT 


ATC 


Phe 


Leu 


Leu 


Ala 


Leu 


Leu 


Ser 


Cys 


Leu 


TTC 


CTC 


TTG 


GCT 


CTG 


CTG 


TCC 


TGT 


TTG 




















Thr 


lie 


Pro 


Ala 


Ser 


Ala 


Tyr 


Glu 


Val 


ACC 


ATC 


CCA 


GCT 


TCC 


GCT 


TAT 


GAA 


GTG 


— T 


G-G 


— c 




— G 


— C 


— C 


C — 






Val 












Gin 




Arg 


Asn 


Val 


Ser 


Gly 


He 


Tyr 


His 


Val 


CGC 


AAC 


GTG 


TCC 


GGG' 


ATA 


TAC 


CAT 


GTC 






TCC 


A-G 




C-T 




— C 








Ser 


Thr 




Leu 








Thr 


Asn 


Asp 


Cys 


Ser 


Asn 


Ser 


Ser 


He 


ACA 


AAC 


GAC 


TGC 


TCC 


AAC 


TCA 


AGC 


ATT 


— c 


— T 


— T 




C-T 




— G 


— T 





Pro 
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Val Tyr Glu Ala Ala Asp Val lie Met 



GTG 


TAT 
— C 


GAG 


GCG 


GCG 
— C 


GAC 
— T 


His 


Ala Pro Gly 


cys 


Val 


CAT 


GCC 


CCC 


GGG 


TGC 


GTG 


— C 


A-T 


— G 






— C 




Thr 










Arg 


Glu 


Asn 


Asn 


Ser 


Ser 


CGG 


GAG 


AAC 


AAT 


TCC 


TCC 


— T 




GG- 


— C 


G — 


— G 






Gly 




Ala 




Val 


Ala 


Leu 


Thr 


Pro 


Thr 


GTA 


GCG 


CTC 


ACT 


ccc 


Avis 


— A 




A-G 


— C 


~T 








Met 








Arg 


Asn 


Ala 


Ser 


Val 


Pro 


AGG 


AAT 


GCC 


AGC 


GTC 


CCC 





G — 


-G- 


-AA 


C — 






Asp 


Gly Lys 


Leu 




Leu 


Arg Arg 


His 


Val 


Asp 


TTA 


CGA 


CGC 


CAC 


GTC 


GAC 


C-T 




















He 




Gly 


Thr 


Ala 


Ala 


Phe 


Cys 


GGG 


ACG 


GCT 


GCT 


TTC 


TGC 




-GC 


— C 


A-C 


C — 


— T 




Ser 




Thr 


Leu 





Ala 



Leu 



WO 91/04262 

Jl 227 

Jl 254 

Jl 281 

Jl 308 

Jl 335 
Gin 

Jl 362 

Jl 389 

Jl 416 



Tyr Val Gly 
TAC GTG GGG 



Asp Leu Cys 
GAT CTC TGC 
— C — A 



PCT/US90/05242 



Pro 


Cys 


Val 


CCC 
— T 


TGC 


GTT 


Arg 


Cys 


Trp 


CGT 


TGC 


TGG 


A-G 


— T 




Leu 


Ala 


Ala 


CTC 


GCG 


GCC 


G-G 


~ C 


A — 


Val 




Thr 


Thr 


Thr 


Thr 


ACT 


ACG 


ACA 


G-G 




CAG 



Ala 



Leu Leu Val 
TTG CTC GTT 
C T — C 



Ser Ala Met 
TCC GCT ATG 
— G — C C-C 
Leu 



Gly Ser Val 
GGA TCT GTT 
— G C 
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Phe Leu lie Ser Gin Leu Phe Thr Phe 

Jl 443 TTC CTC ATC TCC CAG CTG TTC ACC TTC 

— T — t G — GG- — A 

Val Gly 



Ser Pro Arg Arg His Glu Thr Val Gin 

Jl 470 TCG CCT CGC CGG CAT GAG ACA GTA CAG 

— T — C A-G — C — C TG- — G ACG — A 

Trp Thr 



Asp Cys Asn Cys Ser lie Tyr Pro Gly 
Jl 497 GAC TGC AAC TGC TCA ATC TAT CCC GGC 
-GT — T — T 

Gly 



Jl 



His Val Ser Gly His Arg Met Ala Trp 
524 CAC GTA TCA GGC CAT CGC ATG GCT TGG 

— T A — A-G — T — C A 

lie Thr 



Jl 



Asp Met Met Met Asn Trp Ser Pro Thr 
551 GAT ATG ATG ATG AAC TGG TCG CCC ACG 
C — T 



Ala Ala Leu 

Jl 578 GCA GCC TTA 

A-G — G — G 
Thr 



Val Val Ser 
GTG GTG TCG 
— A A — G-T 
Met Ala 



Gin Leu Leu 
CAG TTA CTC 
C-G 



Arg lie Pro Gin Ala Val Met Asp Met 
Jl 605 CGG ATC CCA CAA GCT GTC ATG GAC ATG 

C A — T 

lie Leu 



Val Ala Gly Ala His Trp Gly Val Leu 
Jl 632 GTG GCG GGG GCC CAC TGG GGA GTC CTA 

A-C — T — T — T — G 

He 

FIG. 14-3 
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Ala Gly Leu Ala Tyr Tyr Ser Met Val 

Jl 659 GCG GGC CTT GCC TAC TAT TCC ATG GTG 

A-A — G — T -TC 

lie Phe 



Gly Asn Trp Ala Lys Val Leu lie Val 
Jl 686 GGG AAC TGG GCT AAG GTT TTG ATT GTG 

G C C — G-A 

Val 



Met Leu Leu Phe Ala Gly Val Asp Gly 
Jl 713 ATG CTA CTC TTT GCC GGC GTT GAC GGG 

c G — A C -C- 

Leu Ala 



His Thr Arg Val Thr Gly Gly Val Gin 
jl 740 CAT ACC CGC GTG ACG GGG GGG GTG CAA 
G-A -A- — C — C — A AGT GCC 

Glu His Ser Ala 



Gly His Val Thr Ser Thr Leu Thr Ser 
Jl 767 GGC CAC GTC ACC TCT ACA CTC ACG TCC 

ACT GTG GGA T-T GTT AG- 

Thr Val Gly Phe Val 



Leu 


Phe 


Arg 


Pro 


Gly 


Ala 


Ser 


Gin 


Lys 


CTC 


TTT 


AGA 


CCT 


GGG 


GCG 


TCC 


CAG 


AAA 




C-C 


GC- 


— A 


— C 


— C 


AAG 


C — 


— C 




Leu 


Ala 








Lys 




Asn 





He 


Gin 


Leu 


Val 


Asn 


Thr 


Asn 


Gly 


Ser 


821 


ATT 


CAG 


CTT 


GTA 


AAC 


ACC 


AAT 


GGC 


AGT 




G-C 




— G 


A-C 






— C 








Val 






He 














Trp 


His 


He 


Asn 


Arg 


Thr 


Ala 


Leu 


Asn 


848 


TGG 


CAT 


ATC 


AAC 


AGG 


ACT 


GCC 


CTG 


AAC 
































Ser 
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Asn Asp Ser 


Leu 


Gin 


Thx 


Gly 


Phe 


Jl 


875 


TGC 


AAT 


GAC 


ICC 


CTC 


CAA 


ACT 


GGG 


TTC 








— T 


AG- 




A-C 


— C 


— C 


-GG 
















Asn 






Trp 






Leu 


Ala 


Ala 














Jl 


902 


CTT 


GCC 


GCG 


CTG 


TTC 


TAC 


ACA 


CAC 


AAG 






T-G 












CAC 


C — 












Gly 








His 






Jl 


929 


TTC 


AAC 


GCG 


TCC 


GGA 


TGC 


CCG 


GAG 


CGC 




















A-G 



Ser 



Ser lie Asp Lys 
ATG GCC AGC TGT CGC TCC ATT GAC AAG 

Jl 956 C-A : C — A C~ C— AC- G-T 

Leu Pro Leu Thr Asp 



Phe Asp Gin Gly Trp Gly Pro lie Thr 
Jl 983 TTC GAC CAG GGA TGG GGT CCC ATC ACC 

— T — c — C — T -GT 

Ser 



Tyr Ala Gin Pro Asp Asn Ser Asp Gin 
Jl 1010 TAT GCT CAA CCT GAC AAC TCG GAC CAG 

C AAC GGA AGC GG- C-C 

Asn Gly Ser Gly Pro 



Arg Pro Tyr Cys Trp His Tyr Ala Pro 
Jl 1037 AGG CCG TAT TGC TGG CAC TAC GCA CCT 

C-C — C — C C-C — A 

Pro 



Arg Gin Cys Gly lie Val Pro Ala Ser 
Jl 1064 CGA CAG TGT GGT ATC GTA CCC GCG TCG 

AA- -CT — C — T — G AA- 

Lys Pro Lys 
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Gin Val Cys Gly Pro Val Tyr Cys Phe 
Jl 1091 CAG GTG TGC GGT CCA GTG TAT TGC TTC 

AGT T G —A 

Ser 



Thr Pro Ser Pro Val Val Val Gly Thr 
Jl 1118 ACC CCA AGC CCT GTT GTA GTG GGG ACG 
— T — C — C — G — G — A 



Thr Asp Arg Phe Gly Ala Pro Thr Tyr 
Jl 1145 ACC GAT CGT TTC GGC GCC CCT ACG TAT 

A-G -CG G — C — C — C 

Ser 



Asn Trp Gly Asp Asn Glu Thr Asp Val 
Jl 1172 AAC TGG GGG GAC AAT GAG ACG GAC GTG 

_ G T — A T C 

Ser Glu Asp 



Leu Leu Leu Asn Asn Thr Arg Pro Pro 
Jl 1199 CTG CTC CTA AAC AAC ACG CGG CCC CCG 

T-C G — — T — T — C A — — A 

Phe Val 



His Gly Asn Trp Phe Gly Cys Thr 

Jl 1226 CAC GGC AAC TGG TTC GGC TGT ACA 

-TG — T — T — 

Leu _ _ 

FIG. 14-6 
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Gly Asn Trp Phe Gly Cys Thr Trp Met 
j X i TG GGC AAC TGG TTC GGC TGT ACA TGG ATG 
HCV-1 T T C 

Asn Ser Thr Gly Phe Thr Lys Thr Cys 
j! 30 AAT AGC ACT GGG TTC ACC AAG ACG TGC 

HCV-1 — C TCA A A GT 

Val 



Gly Gly Pro Pro Cys Asn lie Gly Gly 

j! 57 GGA GGC CCC CCG TGT AAC ATC GGG GGG 

HCV-1 ~CG — T — T GT- — A 

Val 



Jl 

HCV-1 



Val Gly Asn Asn Thr Leu Thr Cys Pro 

84 GTC GGC AAC AAC ACC TTG ACC TGC CCC 

-CG C— CA 

Ala His 



Jl 111 
HCV-1 



Thr Asp Cys Phe Arg Lys Thr Pro Thr 
ACG GAC TGC TTC CGG AAG ACC CCG ACG 

T — t — C CAT GAC 

His Asp 



Ala Thr Tyr Thr Lys Cys Gly Ser Gly 
138 GCC ACT TAC ACA AAA TGT GGT TCG GGC 
— A T-T CGG — C — C — C — T 

Ser Arg 



Jl 165 
HCV-1 



Pro Trp Leu Thr Pro Arg Cys Leu Val 
CCT TGG TTG ACA CCT AGG TGC TTG GTT 

__C A-C C C C 

He 



Jl . 192 
HCV-1 ~" 



Asp Tyr Pro Tyr Arg Leu Trp His Tyr 
GAC TAC CCA TAC AGG CTC TGG CAC TAC 
— G — T — T — T — T 
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Pro Cys Thr Val As.n Phe Thr lie Phe 
j! 219 CCC TGC ACT GTC AAC TTT ACC ATC TTC 

HCV-1 — T — T — C A AC A — T 

He Tyr 



Lys Val Arg Met Tyr Val Gly Gly Val 

Jl 246 AAG GTT AGG ATG TAT GTG GGG GGC GTG 

HCV-1 —A A-C C A — G — C 

He 



Glu His 
Jl 273 GAG CAC 
HCV-1 — A 

FIG. 15-2 
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C200 region sequence vs. HCV-1 



Asn Met Ser 

C 200 3799 AAT ATG TCC 

HCV-1 3781 ACA CTG GGC TTT GGT GCT T-C 

Thr Leu Gly Phe Gly Ala Tyr 



Lys Ala His Gly Thr Asp Pro Asn lie 
C2O0 3808 AAG GCA CAT GGC ACC GAC CCC AAC ATC 

HCV-1 T G -T- — T — T 

He 



Arg Thr Gly Val Arg Thr He Thr Thr 
C200 3835 AGA ACT GGG GTA AGG ACC ATC ACC ACA 
HCV-1 — G — C G —A —A — T T 



Gly Ala Pro He Thr Tyr Ser Thr Tyr 
C200 3862 GGT GCC CCC ATT ACG TAC TCC ACC TAT 

HCV-1 ~ C AG C C 

Ser 



Arg Lys Phe Leu Ala Asp Gly Gly Cys 
C200 3889 CGC AAG TTC CTT GCC GAC GGT GGT TGC 
HCV-1 G ■ c G 

Gly 



C200 
HCV-1 



Ser Gly Gly Ala Tyr Asp He He 
3916 TCC GGG GGC GCC TAT GAC ATC ATA A 

G — T — A -TT 

He 



HCV-1 3943 TGT GAC GAG TGC CAC TCC ACG GAT GCC 
Cys Asp Glu Cys His Ser Thr Asp Ala 



HCV-1 3970 ACA TCC ATC TTG GGC ATC GGC ACT GTC 
Thr Ser He Leu Gly He Gly Thr Val 

FIG. 16-1 
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HCV-l 3997 CTT GAC CAA GCA GAG ACT GCG GGG GCG 
Leu Asp Gin Ala Glu Thr Ala Gly Ala 



HCV-l 4024 AGA CTG GTT GTG CTC GCC ACC GCC ACC 
Arg Leu Val Val Leu Ala Thr Ala Thr 



HCV-l 4051 CCT CCG GGC TCC GTC ACT GTG CCC CAT 
Pro Pro Gly Ser Val Thr Val Pro His 

HCV-l 4078 CCC AAC ATC GAG GAG GTT GCT CTG TCC 
Pro Asn He Glu Glu Val Ala Leu Ser 



HCV-l 4105 ACC ACC GGA GAG ATC CCT TTT TAC GGC 
Thr Thr Gly Glu He Pro Phe Tyr Gly 

Ser He Pro He Glu Ala He Lys 

C200 4132 A AGC ATC CCC ATC GAG GCC ATC AAG 

HCV-l AAG GCT C A -TA 

Lys Ala Val 



Gly Gly Arg His Leu He Phe Cys His 
C200 4159 GGG GGA AGG CAT CTC ATC TTC TGC CAT 
HCV-l G —A T 

Ser Lys Lys Lys Cys Asp Glu Leu Ala 
C200 4186 TCC AAG AAG AAG TGT GAC GAG CTC GCC 
HCV-l —A C A 

Ala Lys Leu Ser Ala Leu Gly Leu Asn 
C200 4213 GCA AAG CTG TCA GCC CTC GGA CTC AAT 

HCV-l GTC — A T-G — C A — 

Val He 



Ala Val Ala Tyr Tyr Arg Gly Leu Asp 
C200 ^ 4240 GCC GTG GCG TAT TAC CGC GGT CTT GAT 
HCV-l C — C C 

FIG. 16-2 
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Val Ser Val lie Pro Thr Ser Gly Asp 

C200 4267 GTG TCC GTC ATA CCA ACT AGC GGA GAC 

HCV-1 C — G — C — C — T 



Val Val Val Val Ala Thr Asp 
C200 4294 GTC GTT GTC GTG GCA ACA GAC GC 4316 
HCV-1 — T — C — C — T — C CTC 



HCV-1 4321 ATG ACC GGC TAT ACC GGC GAC TTC GAC 
Met Thr Gly Tyr Thr Gly Asp Phe Asp 



HCV-1 4348 TCG GTG ATA GAC TGC AAT ACG TGT GTC 
Ser Val lie Asp Cys Asn Thr Cys Val 



HCV-1 4375 ACC CAG ACA GTC GAT TTC AGC CTT GAC 
Thr Gin Thr Val Asp Phe Ser Leu Asp 



HCV-1 4402 CCT ACC TTC ACC ATT GAG ACA ATC ACG 
Pro Thr Phe Thr lie Glu Thr lie Thr 



HCV-1 4429 CTC CCC CAG GAT GCT GTC TCC CGC ACT 
Leu Pro Gin Asp Ala Val Ser Arg Thr 



HCV-1 4456 CAA CGT CGG GGC AGG ACT GGC AGG GGG 

Gin Arg Arg Gly Arg Thr Gly Arg Gly 



HCV-1 4483 AAG CCA GGC ATC TAC AGA TTT GTG GCA 
Lys Pro Gly lie Tyr Arg Phe Val Ala 



HCV-1 4510 CCG GGG GAG CGC CCC TCC GGC ATG TTC 
Pro Gly Glu Arg Pro Ser Gly Met Phe 



HCV-1 4537 GAC TCG TCC GTC CTC TGT GAG TGC TAT 
Asp Ser Ser Val Leu Cys Glu Cys Tyr 

FIG. 16-3 
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HCV-l 4564 GAC GCA GGC TGT GCT TGG TAT GAG CTC 

Asp Ala Gly Cys Ala Trp Tyr Glu Leu 



HCV-l 4591 ACG CCC GCC GAG ACT ACA GTT AGG CTA 
Thr Pro Ala Glu Thr Thr Val Arg Leu 

HCV-l 4618 CGA GCG TAC ATG AAC ACC CCG GGG CTT 
Arg Ala Tyr Met Asn Thr Pro Gly Leu 

HCV-l 4645 CCC GTG TGC CAG GAC CAT CTT GAA TTT 
Pro Val Cys Gin Asp His Leu Glu Phe 

HCV-l 4672 TGG GAG GGC GTC TTT ACA GGC CTC ACT 
Trp Glu Gly Val Phe Thr Gly Leu Thr 

HCV-l 4699 CAT ATA GAT GCC CAC TTT CTA TCC CAG 
His lie Asp Ala His Phe Leu Ser Gin 

HCV-l 4726 ACA AAG CAG AGT GGG GAG AAC CTT CCT 
Thr Lys Gin Ser Gly Glu Asn Leu Pro 

HCV-l 4753 TAC CTG GTA GCG TAC CAA GCC ACC GTG 

Tyr Leu Val Ala Tyr Gin Ala Thr Val 
HCV-l 4780 TGC GCT AGG GCT CAA GCC CCT CCC CCA 

Cys Ala Arg Ala Gin Ala Pro Pro Pro 

HCV-l 4807 TCG TGG GAC CAG ATG TGG AAG TGT TTG 
Ser Trp Asp Gin Met Trp Lys Cys Leu 

HCV-l 4834 ATT CGC CTC AAG CCC ACC CTC CAT GGG 
lie Arg Leu Lys Pro Thr Leu His Gly 

HCV-l 48 61 CCA ACA CCC CTG CTA TAC AGA CTG GGC 
Pro Thr Pro Leu Leu Tyr Arg Leu Gly 

FIG. 16-4 
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HCV-l 4888 GCT GTT CAG AAT GAA ATC ACC CTG ACG 
Ala Val Gin Asn Glu lie Thr Leu Thr 

V 

t HCV-l 4915 CAC CCA GTC ACC AAA TAC ATC ATG ACA 

His Pro Val Thr Lys Tyr lie Met Thr 



HCV-l 4942 TGC ATG TCG GCC GAC CTG GAG GTC GTC 
Cys Met Ser Ala Asp Leu Glu Val Val 



HCV-l 4969 ACG AGC ACC TGG GTG CTC GTT GGC GGC 
Thr Ser Thr Trp Val Leu Val Gly Gly 



HCV-l 4996 GTC CTG GCT GCT TTG GCC GCG TAT TGC 

Val Leu Ala Ala Leu Ala Ala Tyr Cys 



HCV-l 5023 CTG TCA ACA GGC TGC GTG GTC ATA GTG 
Leu Ser Thr Gly Cys Val Val lie Val 



HCV-l 5050 GGC AGG GTC GTC TTG TCC GGG AAG CCG 
Gly Arg Val Val Leu Ser Gly Lys Pro 



Glu Val Leu 

C200 GAA GTC CTC 

HCV-l 5077 GCA ATC ATA CCT GAC AGG 

Ala lie lie Pro Asp Arg 



Tyr Arg Glu Phe Asp Glu Met Glu Glu 
C200 5104 TAC CGA GAG TTC GAT GAG ATG GAA GAG 
HCV-l 



Cys Ala Ser His Leu Pro Tyr He Glu 
C200 5131 TGC GCC TCA CAC CTC CCC TAC ATC GAA 

HCV-l T-T CAG T-A — G — G 

Ser Gin 

FIG. 16-5 
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i 

Gin Gly Met Gin Leu Ala Glu Gin Phe 
C200 5158 CAG GGA ATG CAG CTC GCC GAG CAA TTC 

HCV-1 — A — G AT- — G 

Met 



Lys Gin Lys Ala Leu Gly Leu Leu Gin 
C200 5185 AAG CAG AAG GCG CTC GGG TTG CTG CAA 
HCV-1 — C — C C-C — G 







Thr 


Ala 


Thr 


Lys 


Gin 


Ala 


Glu 


Ala 


Ala 


C200 


5212 


ACA 


GCC 


ACC 


AAG 


CAA 


GCG 


GAG 


GCT 


GCT 


HCV-1 




— C 


— G 


T — 
Ser 


CGT 
Arg 


— G 


— A 




-T- 

Val 


ATC 
He 






Ala 


Pro 


Cys 


Glu 


Ser 


Met 


His 


Ala 


Ser 


C200 


5239 


GCT 


CCG 


TGT 


GAG 


TCA 


ATG 


CAC 


GCC 


TCG 


HCV-1 




— C 


— T 


GC- 


-TC 


CAG 


-CC 


A — 


TGG 


CAA 










Ala 


Val 


Gin 


Thr 


Asn 


Trp 


Gin 


C200 


5266 


A 


















HCV-1 




-AA 


CTC 


GAG 


ACC 


TTC 


TGG 


GCG 


AAG 


CAT 






Lys 


Leu 


Glu 


Thr 


Phe 


Trp 


Ala 


Lys 


His 



HCV-1 5293 ATG TGG AAC TTC ATC AGT GGG ATA CAA TA 
Met Trp Asn Phe He Ser Gly He Gin 

FIG. 16-6 
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NS1 Sequence vs.. HCV-1 

Leu Gly Asn Trp Phe Gly Cys Thr Trp 
j! 1 G TTG GGC AAT TGG TTC GGT TGC ACC TGG 
HCV-1 - C T 



Met Asn Ser Ser Gly Phe Thr Lys Val 
jl 29 ATG AAC TCA TCT GGA TTT ACC AAA GTG 
HCV-1 A C 

Thr 

Cys Gly Ala Pro Pro Cys Val lie Gly 
j! 56 TGC GGA GCG CCT CCT TGT GTC ATC GGA 

HCV-1 

Ala 



Gly Val Gly Asn Asn Thr Leu Gin Cys 
j! 83 GGG GTG GGC AAC AAC ACC TTG CAA TGC 

HCV-1 c :~ c— --c 

Ala His 

Pro Thr Asp Cys Phe Arg Lys His Pro 
j! no CCC ACT GAC TGT TTC CGC AAG CAT CCG 
HCV-1 T 

Asp Ala Thr Tyr Ser Arg Cys Gly Ser 
jl 137 GAC GCC ACA TAC TCT CGG TGC GGT TCC 
HCV-1 C 



Gly Pro Trp lie Thr Pro Arg Cys Leu 
jl 164 GGT CCC TGG ATT ACG CCC AGG TGC CTG 
HCV-1 C — A 



Val His Tyr Pro Tyr Arg Leu Trp His 

jl 191 GTC CAC TAC CCT TAT AGG CTT TGG CAT 

HCV-1 G G 

Asp 

Tyr Pro Cys Thr Val Asn Tyr Thr Leu 

Jl" 218 TAT CCC TGT ACT GTC AAC TAC ACC TTG 

HCV-1 T — -~C A A-A 

He He 



„ ; .... :: mji r u, 
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Phe Lys Val Arg Met Tyr Val Gly Gly 
Jl 245 TTC AAA GTC AGG ATG TAC GTG GGA GGG 
HCV-1 — T A 

lie 



Jl 

HCV-1 



Val Glu His Arg Leu Glu Val Ala Cys 
272 GTC GAG CAC AGG CTG GAA GTT GCT TGC 

A C C 

Ala 



Jl 

HCV-1 



Asn Trp Thr Arg Gly Glu Arg Cys Asp 
299 AAC TGG ACG CGG GGC GAG CGT TGT GAT 
A C 



Jl 

HCV-1 



326 



Leu Asp Asp Arg Asp 
CTG GAC GAC AGG GAC A 

A 

Glu 

FIG. 17-2 



SUBSTITUTE SHEET 



78/79 

WO 91/04262 PCT/US90/0S242 

Core Sequence vs. HCV-1 

j-L 1 GCGTCTAGCCATGGCGTTAGTATGAGTGTC 
HCV-1 

jl 31 GTGCAGCCTCCAGGACCCCCCCTCCCGGGAGAGCC 
HCV-1 

jl 66 ATAGTGGTCTGCGGAACCGGTGAGTACACCGGAAT 
HCV-1 

jl 101 TGCCAGGACGACCGGGTCCTTTCTTGGATCAACCC 
HCV-1 

jl 136 GCTCAATGCCTGGAGATTTGGGCGTGCCCCCGCGA 
HCV-1 A " 

Jl 171 GACTGCTAGCCGAGTAGTGTTGGGTCGCGAAAGGC 
HCV-1 

Jl 206 CTTGTGGTACTGCCTGAT AGGGTGCTTG CG AGTG C 
HCV-1 . " 

Met Ser 

Jl 241 CCCGGGAGGTCTCGTAGACCGTGCATCATG AGC 
HCV-1 ~ C 

Thr Asn Pro Lys Pro Gin Arg Lys Thr 
jl 274 ACA AAT CCT AAA CCT CAA AGA AAA ACC 

HCV-1 — G A- -A- 

Lys Asn 

Lys Arg Asn Thr Asn Arg Arg Pro Gin 
Jl 301 AAA CGT AAC ACC AAC CGC CGC CCA CAG 
HCV-1 T 

Asp Val Lys Phe Pro Gly Gly Gly Gin 
jl 328 GAC GTC AAG TTC CCG GGC GGT GGT CAG 
HCV-1 T — C 

lie Val Gly Gly Val Tyr Leu Leu Pro 
jl 355 ATC GTT GGT GGA GTT TAC CTG TTG CCG 
HCV-1 T 

Arg Arg Gly Pro Arg Leu Gly Val Arg 
Jl 3 82 CGC AGG GGC CCC AGG TTG GGT GTG CGC 

HCV-1 — T — A 

Clf-N -J Q_ -J 
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Ala Thr Arg Lys Thr Ser Glu Arg Ser 
409 GCG ACT AGG AAG ACT TCC GAG CGG TCG 



HCV- 


1 










Jl 
HCV- 


1 


436 


Gin Pro Arg Gly Arg Arg Gin 
CAA CCT CGT GGA AGG CGA CAA 


Pro 
CCT 


He 
ATC 


Jl 




463 


Pro Lys Ala Arg Gin Pro Glu 
CCC AAG GCT CGC CAG CCC GAG 


Gly 
GGC 


Arg 
AGG 


HCV- 


■1 




Arg 






Jl 




490 


Ala Trp Ala Gin Pro Gly Tyr 

GCC TGG GCT CAG CCC GGG TAC 


Pro 
CCT 


Trp 
TGG 


HCV- 


•1 




Thr 






Jl 
HCV- 


-1 


517 


Pro Leu Tyr Gly Asn Glu Gly Met Gly 
CCC CTC TAT GGC AAC GAG GGC ATG GGG 

Cys 


Jl 




544 


Trp Ala Gly Trp Leu 

TGG GCA GGA TGG CTC CT 






HCV- 


-1 




FIG. 18-2 
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